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NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
HELICOBACTER PYLORI AND VACCINE COMPOSITIONS THEREOF 

Background of the Invention 
5 Helicobacter pylori is a gram-negative, S-shaped, microaerophilic bacterium that 

was discovered and cultured from a human gastric biopsy specimen. (Warren, J.R. and 
B. Marshall, (1983) Lancet J_: 1273-1275; and Marshall et al., (1984) Microbios Lett. 25: 
83-88). H. pylori has been strongly linked to chronic gastritis and duodenal ulcer 
disease. (Rathbone et. al., (1986) Gut 27: 635-641). Moreover, evidence is 

10 accumulating for an etiologic role of H. pylori in nonulcer dyspepsia, gastric ulcer 

disease, and gastric adenocarcinoma. (Blaser M. J., (1993) Trends Microbiol. I : 255- 
260). Transmission of the bacteria occurs via the oral route, and the risk of infection 
increases with age. (Taylor, D.N. and M. J. Blaser, (1991) Epidemiol. Rev 13' 42-50). 
H. pylori colonizes the human gastric mucosa, establishing an infection that usually 

1 5 persists for decades. Infection by H. pylori is prevalent worldwide. Developed 
countries have infection rates over 50% of the adult population, while developing 
countries have infection rates reaching 90% of the adults over the age of 20. (Hopkins 
R. J. and J. G. Morris (1994) Am. J. Med 97: 265-277). 

The bacterial factors necessary for colonization of the gastric environment, and 

20 for virulence of this pathogen, are poorly understood. Examples of the putative 
virulence factors include the following: urease, an enzyme that may play a role in 
neutralizing gastric acid pH (Eaton et al., (1991) Infect. Immunol. 59: 2470-2475; 
Ferrero, R.L. and A. Lee (1991) Microb. Ecol. Hlth Dis. 4: 121-134; Labigne et al., 
(1991) J. Bacteriol 173: 1920-1931); the bacterial flagellar proteins responsible for 

25 motility across the mucous layer. (Hazell et al., (1986) J. Inf. Dis. 153: 658-663; Leying 
et al., (1992) Mol Microbiol 6: 2863-2874; and Haas et al., (1993) Mol Microbiol. 8: 
753-760); Vac A, a bacterial toxin that induces the formation of intracellular vacuoles in 
epithelial cells (Schmitt, W. and R. Haas, (1994) Molecular Microbiol. 12(2) : 307-319); 
and several gastric tissue-specific adhesins. (Boren et al., (1993) Science 262 : 1892- 

30 1895; Evans et al., (1993) J. Bacteriol 175: 674-683; and Falk et al., (1993) Proc. Natl 
Acad. Sci. USA 90: 2035-203). 

Numerous therapeutic agents are currently available that eradicate H. pylori 
infections in vitro. (Huesca et. al., (1993) Zbl. Bakt. 280: 244-252; Hopkins, R. J. and J. 
G. Morris, supra). However, many of these treatments are suboptimally effective in vivo 

35 because of bacterial resistance, altered drug distribution, patient non-compliance or poor 
drug availabilty. (Hopkins, R. J. and J. G. Morris, supra). Treatment with antibiotics 
combined with bismuth are part of the standard regime used to treat H. pylori infection. 
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(Malfertheiner, P. and J. E. Dominguez-Munoz (1993) Clinical Therapeutics J_5 Supp. 
q : 37-48). Recently, combinations of a proton pump inhibitors and a single antibiotic 
have been shown to ameliorate duodenal ulcer disease. (Malfertheiner, P. and J. E. 
Dominguez-Munoz supra). However, methods employing antibiotic agents can have the 
5 problem of the emergence of bacterial strains which are resistant to these agents. 

(Hopkins, R. J. and J. G. Morris, supra). These limitations demonstrate that new more 
effective methods are needed to combat H. pylori infections in vivo. In particular, the 
design of new vaccines that may prevent infection by this bacterium is highly desirable. 

10 Summary of the Invention 

This invention relates to novel genes, e.g., genes encoding polypeptides such as 
bacterial surface proteins, from the organism Helicobacter pylori (H pylori), and other 
related genes, their products, and uses thereof. The nucleic acids and peptides of the 
present invention have utility for diagnostic and therapeutics for K pylori and other 

1 5 Helicobacter species. They can also be used to detect the presence of H pylori and 
other Helicobacter species in a sample; and for use in screening compounds for the 
ability to interfere with the H pylori life cycle or to inhibit H. pylori infection. More 
specifically, this invention features compositions of nucleic acids corresponding to 
entire coding sequences of H. pylori proteins, including surface or secreted proteins or 

20 parts thereof, nucleic acids capable of binding mRNA from H. pylori proteins to block 
protein translation, and methods for producing H. pylori proteins or parts thereof using 
peptide synthesis and recombinant DNA techniques. This invention also features 
antibodies and nucleic acids useful as probes to detect H pylori infection. In addition, 
vaccine compositions and methods for the protection against infection by H. pylori are 

25 within the scope of this invention. 

Detailed Description of the Drawings 

Figure 1 is a bar graph that depicts the antibody titer in serum of mice following 
immunization with specific H pylori antigens. 
30 Figure 2 is a bar graph that depicts the antibody titer in mucous of mice 

following immunization with specific H. pylori antigens. 

Figure 3 is a bar graph that depicts therapeutic immunization ofH pylori 
infected mice with specific antigens dissolved in HEPES buffer. 

Figure 4 is a bar graph that depicts therapeutic immunization of H pylori 
35 infected mice with specific antigens dissolved in buffer containing DOC. 

Figure 5 is a graph depicting the activity of recombinant PPIase. 
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Figure 6 is a graph depicting PPIase activity in an H. pylori extract. 
Figure 7 is a graph depicting a decrease of glutamate racemase activity with L- 
Serine-O-Sulfate. 

Figure 8 depicts the amino acid sequence alignment in a portion of the sequence 
of 12 H. pylori proteins (depicted in the single letter amino acid code and designated by 
their amino acid Sequence ID Numbers; shown N-terminal to C -terminal, left to right). 

Figure 9 depicts the N-terminal portion of nine H. pylori proteins (depicted in the 
single letter amino acid code and designated by their amino acid Sequence ID Numbers; 
shown N-terminal to C -terminal, left to right). 



Detailed Description of the Invention 

In one aspect, the invention features a recombinant or substantially pure 
preparation of H. pylori polypeptide of SEQ ID NO: 492. The invention also includes 
substantially pure nucleic acid encoding an H. pylori polypeptide of SEQ ID NO: 492, 

15 such nucleic acid is contained in SEQ ID NO: 1. The H. pylori polypeptide sequences 
described herein are contained in the Sequence Listing, and the nucleic acids encoding 
H. pylori polypeptides are contained in the Sequence Listing. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 

20 polypeptides of SEQ ID NO: 492 through SEQ ID NO: 541. The invention also 

includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from 
the group consisting of H. pylori polypeptides SEQ ID NO: 492 through SEQ ID NO: 
541, such nucleic acids are contained in SEQ ID NO: 1 through SEQ ID NO: 50. 

In another aspect, the invention features a recombinant or substantially pure 

25 preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQ ID NO: 542 through SEQ ID NO: 591 . The invention also 
includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from 
the group consisting of K pylori polypeptides SEQ ID NO: 542 through SEQ ID NO: 
591, such nucleic acids are contained in SEQ ID NO: 51 through SEQ ID NO: 100. 

30 In another aspect, the invention features a recombinant or substantially pure 

preparation of an H, pylori polypeptide selected from the group consisting of//, pylori 
polypeptides of SEQ ID NO: 592 through SEQ ID NO: 641. The invention also 
includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from 
the group consisting of K pylori polypeptides SEQ ID NO: 592 through SEQ ID NO: 

35 641, such nucleic acids are contained in SEQ ID NO: 101 through SEQ ID NO: 1 50. 
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In another aspect, the invention features a recombinant or substantially pure 
preparation of an K pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQ ID NO: 642 through SEQ ID NO: 691 . The invention also 
includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from 
5 the group consisting of H. pylori polypeptides SEQ ID NO: 642 through SEQ ID NO: 
691, such nucleic acids are contained in SEQ ID NO: 151 through SEQ ID NO: 200. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an K pylori polypeptide selected from the group consisting of K pylori 
polypeptides of SEQ ID NO: 692 through SEQ ID NO: 741. The invention also 

10 includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from 
the group consisting of H. pylori polypeptides SEQ ID NO: 692 through SEQ ID NO: 
741, such nucleic acids are contained in SEQ ID NO: 201 through SEQ ID NO: 250. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 

1 5 polypeptides of SEQ ID NO: 742 through SEQ ID NO: 759, SEQ ID NO: 761 , SEQ ID 
NO. 763, SEQ ID NO: 765 through SEQ ID NO: 791 . The invention also includes 
substantially pure nucleic acid encoding an H, pylori polypeptide selected from the 
group consisting of//, pylori polypeptides SEQ ID NO: 742 through SEQ ID NO: 759, 
SEQ ID NO: 761, SEQ ID NO: 763, SEQ ID NO: 765 through SEQ ID NO: 791 , such 

20 nucleic acids are contained in SEQ ID NO: 251 through SEQ ID NO: 268, SEQ ID NO: 
270, SEQ ID NO: 272, and SEQ ID NO: 274 through SEQ ID NO: 300. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an //. pylori polypeptide selected from the group consisting of K pylori 
polypeptides of SEQ ID NO: 792 through SEQ ID NO: 818 and SEQ ID NO: 820 

25 through SEQ ID NO: 841. The invention also includes substantially pure nucleic acid 
encoding an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides SEQ ID NO: 792 through SEQ ID NO: 81 8 and SEQ ID NO: 820 through 
SEQ ID NO: 841, such nucleic acids are contained in SEQ ID NO: 301 through SEQ ID 
NO: 327 and SEQ ID NO: 329 throgh SEQ ID NO: 350. 

30 In another aspect, the invention features a recombinant or substantially pure 

preparation of an K pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQ ID NO: 842 through SEQ ID NO: 846 and SEQ ID NO: 848 
through SEQ ID NO: 891. The invention also includes substantially pure nucleic acid 
encoding an K pylori polypeptide selected from the group consisting of K pylori 

35 polypeptides SEQ ID NO: 842 through SEQ ID NO: 846 and SEQ ID NO: 848 through 
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SEQ ID NO: 891, such nucleic acids are contained in SEQ ID NO: 351 through SEQ ID 
NO: 364 and SEQ ID NO: 366 through SEQ ID NO: 400. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of K pylori 
5 polypeptides of SEQ ID NO: 892 through SEQ ID NO: 896 and SEQ ID NO: 898 

through SEQ ID NO: 941 . The invention also includes substantially pure nucleic acid 
encoding an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides SEQ ID NO: 892 through SEQ ID NO: 896 and SEQ ID NO: 898 through 
SEQ ID NO: 941, such nucleic acids are contained in SEQ ID NO: 401 through SEQ ID 
10 NO: 405 and SEQ ID NO: 407 through SEQ ID NO: 450. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of//, pylori 
polypeptides of SEQ ID NO: 942 through SEQ ID NO: 963 and SEQ ID NO: 966 
through SEQ ID NO: 982. The invention also includes substantially pure nucleic acid 
1 5 encoding an K pylori polypeptide selected from the group consisting of H. pylori 

polypeptides SEQ ID NO: 942 through SEQ ID NO: 963 and SEQ ID NO: 966 through 
SEQ ID NO: 982, such nucleic acids are contained in SEQ ID NO: 451 through SEQ ID 
NO: 472 and SEQ ID NO: 475 through SEQ ID NO: 491. 

In another aspect, the invention features a recombinant or substantially pure 
20 preparation of an H. pylori polypeptide selected from the group consisting of K pylori 
polypeptides of SEQ ID NO: 1037, SEQ ID NO: 1038, SEQ ID NO: 1041 through SEQ 
ID NO: 1087 and SEQ ID NO: 1090. The invention also includes substantially pure 
nucleic acid encoding an H. pylori polypeptide selected from the group consisting of//. 
pylori polypeptides SEQ ID NO: 1037, SEQ ID NO: 1038, SEQ ID NO: 1041 through 
25 SEQ ID NO: 1087 and SEQ ID NO: 1090, such nucleic acids are contained in SEQ ID 
NO: 983, SEQ ID NO: 984, SEQ ID NO: 987 through SEQ ID NO: 1033 and SEQ ID 
NO: 1036. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an K pylori polypeptide selected from the group consisting of//, pylori 
30 polypeptides of SEQ ID NO: 1296 through SEQ ID NO: 1298. The invention also 

includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from 
the group consisting of//, pylori polypeptides SEQ ID NO: 1296 through SEQ ID NO: 
1298, such nucleic acids are contained in SEQ ID NO: 1293 through SEQ ID NO: 1295. 
In another aspect, the invention features a recombinant or substantially pure 
35 preparation of an H. pylori polypeptide selected from the group consisting of// pylori 
polypeptides as set forth in the Sequence Listing. The invention also includes 
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substantially pure nucleic acid encoding an H. pylori polypeptide selected from the 
group consisting of K pylori polypeptides as set forth in the Sequence Listing. It should 
be understood that this invention encompasses each of the H. pylori polypeptides and 
nucleic acids encoding such polypeptides as identified in the Sequence Listing by a 
5 given sequence identification number. For example, a representative H. pylori 

polypeptide is contained in SEQ ID NO: 494. Therefore, this invention encompasses a 
recombinant or substantially pure preparation of an H. pylori polypeptide of SEQ ID 
NO: 494. The invention also includes substantially pure nucleic acid encoding an K 
pylori polypeptide of SEQ ID NO: 494. 

10 In another aspect, the invention pertains to any individual H. pylori polypeptide 

member or nucleic acid encoding such member from the above-identified groups of H. 
pylori polypeptides (e.g., SEQ ID NO: 542-SEQ ID NO: 591) or nucleic acids (e.g., 
SEQ ID NO: 51 -SEQ ID NO: 100), as well as any subgroups from within the above- 
identified groups. Furthermore, the subgroups can preferably consists of 1, 3, 5, 10, 15, 

1 5 20, 30 or 40 members of any of the groups identified above, as well as, any 

combinations thereof. For example, the group consisting of H. pylori polypeptides SEQ 
ID NO: 692 through SEQ ID NO: 741 can be divided into one or more subgroups as 
follows: SEQ ID NO: 692-SEQ ID NO: 680; SEQ ID NO: 681-SEQ ID NO: 710; SEQ 
ID NO: 71 1-SEQ ID NO: 730; SEQ ID NO: 731-SEQ ID NO: 741 ; or any combinations 

20 thereof. 

Particularly preferred is an isolated nucleic acid comprising a nucleotide 
sequence encoding an H. pylori cell envelope polypeptide or a fragment thereof. Such 
nucleic acid is selected from the group consisting of SEQ ID NO: 255, SEQ ID NO: 
263, SEQ ID NO: 266, SEQ ID NO: 277, SEQ ID NO: 280, SEQ ID NO: 285, SEQ ID 

25 NO: 292, SEQ ID NO: 294, SEQ ID NO: 299, SEQ ID NO: 31 1, SEQ ID NO: 312, SEQ 
ID NO: 313, SEQ ID NO: 321, SEQ ID NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, 
SEQ ID NO: 353, SEQ ID NO: 364, SEQ ID NO: 366, SEQ ID NO: 368, SEQ ID NO: 
375, SEQ ID NO: 384, SEQ ID NO: 391, SEQ ID NO: 392, SEQ ID NO: 397, SEQ ID 
NO: 398, SEQ ID NO: 402, SEQ ID NO: 404, SEQ ID NO: 409, SEQ ID NO: 41 0, SEQ 

30 ID NO: 412, SEQ ID NO: 427, SEQ ID NO: 433, SEQ ID NO: 434, SEQ ID NO: 441, 
SEQ ID NO: 444, SEQ ID NO: 445, SEQ ID NO: 449, SEQ ID NO: 450 ? SEQ ID NO: 
452, SEQ ID NO: 453, SEQ ID NO: 466, SEQ ID NO: 468, SEQ ID NO: 469, SEQ ID 
NO: 983, SEQ ID NO: 989, SEQ ID NO: 1008, SEQ ID NO: 101 1, SEQ ID NO: 1014, 
SEQ ID NO: 1015, SEQ ID NO: 1029, SEQ ID NO: 1032, SEQ ID NO: 259, SEQ ID 

35 NO: 286, SEQ ID NO: 326, SEQ ID NO: 374, SEQ ID NO: 399, SEQ ID NO: 422, SEQ 
ID NO: 454, SEQ ID NO: 465, SEQ ID NO: 998, SEQ ID NO: 1009, SEQ ID NO: 
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1023. SEQ ID NO: 1294, SEQ ID NO: 1295, SEQ ID NO: 319, SEQ ID NO: 325, SEQ 
ID NO: 425, SEQ ID NO: 437, SEQ ID NO: 438, SEQ ID NO: 447, SEQ ID NO: 448, 
SEQ ID NO: 467. SEQ ID NO: 996, SEQ ID NO: 1027, SEQ ID NO: 1031, SEQ ID 
NO: 254, SEQ ID NO: 352, SEQ ID NO: 415, SEQ ID NO: 1019, SEQ ID NO: 381. 
5 SEQ ID NO: 389, SEQ ID NO: 1010, SEQ ID NO: 1012, SEQ ID NO: 354, SEQ ID 
NO: 372, SEQ ID NO: 400, SEQ ID NO: 421, SEQ ID NO: 1022, SEQ ID NO: 463, 
SEQ ID NO: 28 1 , SEQ ID NO: 988, SEQ ID NO: 4 1 1 , SEQ ID NO: 407, SEQ ID NO: 
1017, SEQ ID NO: 290, SEQ ID NO: 4 1 7, SEQ ID NO: 430, SEQ ID NO: 992, SEQ ID 
NO: 1025, SEQ ID NO: 477, SEQ ID NO: 414, SEQ ID NO: 253, SEQ ID NO: 293, 
1 0 SEQ ID NO: 334, SEQ ID NO: 343, SEQ ID NO: 4 1 8, SEQ ID NO: 424, and SEQ ID 
NO: 443. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori outer membrane polypeptide or a fragment thereof encoded by the 
nucleic acid selected from the group consisting of SEQ ID NO: 255, SEQ ID NO: 263, 

1 5 SEQ ID NO: 266, SEQ ID NO: 277, SEQ ID NO: 280, SEQ ID NO: 285, SEQ ID NO: 
292, SEQ ID NO: 294, SEQ ID NO: 299, SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID 
NO: 313, SEQ ID NO: 321, SEQ ID NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, SEQ 
ID NO: 353, SEQ ID NO: 364, SEQ ID NO: 366, SEQ ID NO: 368, SEQ ID NO: 375, 
SEQ ID NO: 384, SEQ ID NO: 391 , SEQ ID NO: 392, SEQ ID NO: 397, SEQ ID NO: 

20 398, SEQ ID NO: 402, SEQ ID NO: 404, SEQ ID NO: 409, SEQ ID NO: 4 1 0, SEQ ID 
NO: 412, SEQ ID NO: 427, SEQ ID NO: 433, SEQ ID NO: 434, SEQ ID NO: 441, SEQ 
ID NO: 444, SEQ ID NO: 445, SEQ ID NO: 449, SEQ ID NO: 450, SEQ ID NO: 452, 
SEQ ID NO: 453, SEQ ID NO: 466, SEQ ID NO: 468, SEQ ID NO: 469, SEQ ID NO: 
983, SEQ ID NO: 989, SEQ ID NO: 1008, SEQ ID NO: 101 1, SEQ ID NO: 1014, SEQ 

25 ID NO: 1015, SEQ ID NO: 1029, SEQ ID NO: 1032, SEQ ID NO: 259, SEQ ID NO: 
286, SEQ ID NO: 326, SEQ ID NO: 374, SEQ ID NO: 399, SEQ ID NO: 422, SEQ ID 
NO: 454, SEQ ID NO: 465, SEQ ID NO: 998, SEQ ID NO: 1009, SEQ ID NO: 1023, 
SEQ ID NO: 1294, SEQ ID NO: 1295, SEQ ID NO: 319, SEQ ID NO: 325, SEQ ID 
NO: 425, SEQ ID NO: 437, SEQ ID NO: 438, SEQ ID NO: 447, SEQ ID NO: 448, SEQ 

30 ID NO: 467, SEQ ID NO: 996, SEQ ID NO: 1027, SEQ ID NO: 1031, SEQ ID NO: 

254, SEQ ID NO: 352, SEQ ID NO: 415, SEQ ID NO: 1019, SEQ ID NO: 381, SEQ ID 
NO: 389, SEQ ID NO: 1010, and SEQ ID NO: 1012. 

In another embodiment, the H. pylori outer membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide having a terminal phenylalanine residue or a 

35 fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ 
ID NO: 255, SEQ ID NO: 263, SEQ ID NO: 266, SEQ ID NO: 277, SEQ ID NO: 280, 
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In another embodiment, the H. pylori inner membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide involved in cofactor metabolism or a fragment there- 
of encoded by the nucleic acid comprising a nucleotide sequence of SEQ ID NO: 463. 

In another embodiment, the H. pylori inner membrane polypeptide or a fragment 
5 thereof is an K pylori polypeptide involved in secretion or adhesion or a fragment 

thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 
281 and SEQ ID NO: 988. 

In another embodiment, the K pylori inner membrane polypeptide or a fragment 
thereof is an K pylori polypeptide involved in transport or a fragment thereof encoded 
10 by the nucleic acid selected from the group consisting of SEQ ID NO: 407 and SEQ ID 
NO: 1017. 

In another embodiment, the K pylori cell envelope polypeptide or a fragment 
thereof is an K pylori flagellar polypeptide or a fragment thereof encoded by the nucleic 
acid comprising a nucleotide sequence of SEQ ID NO: 477. 

15 In another embodiment, the H. pylori cell envelope polypeptide or a fragment 

thereof is an H. pylori transport polypeptide or a fragment thereof encoded by the 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 414. 

Particularly preferred is an isolated nucleic acid comprising a nucleotide 
sequence encoding an H. pylori cytoplasmic polypeptide or a fragment thereof. Such 

20 nucleic acid is selected from the group consisting of SEQ ID NO: 470, SEQ ID NO: 

1033, SEQ ID NO: 357, SEQ ID NO: 457, SEQ ID NO: 461, SEQ ID NO: 1030, SEQ 
ID NO: 345, SEQ ID NO: 383, SEQ ID NO: 387, SEQ ID NO: 455, SEQ ID NO: 1003, 
SEQ ID NO: 351, SEQ ID NO: 416, SEQ ID NO: 278, SEQ ID NO: 335, SEQ ID NO: 
346, SEQ ID NO: 350, SEQ ID NO: 419, SEQ ID NO: 460, SEQ ID NO: 472, SEQ ID 

25 NO: 1000, SEQ ID NO: 1004, SEQ ID NO: 1020, SEQ ID NO: 1293, SEQ ID NO: 318, 
SEQ ID NO: 322, SEQ ID NO: 324, SEQ ID NO: 330, SEQ ID NO: 347, SEQ ID NO: 
440, SEQ ID NO: 446, SEQ ID NO: 464, SEQ ID NO: 490, SEQ ID NO: 49 1 , SEQ ID 
NO: 995, SEQ ID NO: 997, SEQ ID NO: 1005, and SEQ ID NO: 1028. 

In another embodiment, the K pylori cytoplasmic polypeptide or a fragment 

30 thereof is an H. pylori polypeptide involved in energy conversion or a fragment thereof 
encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 470 and 
SEQ ID NO: 1033. 

In another embodiment, the K pylori cytoplasmic polypeptide or a fragment 
thereof is an H. pylori polypeptide involved in amino acid metabolism and transport or a 
35 fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ 
ID NO: 357 and SEQ ID NO: 457. 
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In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 
thereof is an H. pylori polypeptide involved in nucleotide metabolism and transport or a 
fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ 
ID NO: 461 and SEQ ID NO: 1030. 
5 In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 

thereof is an H. pylori polypeptide involved in cofactor metabolism or a fragment 
thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 
345, SEQ ID NO: 383, SEQ ID NO: 387, SEQ ID NO: 455, and SEQ ID NO: 1003. 
In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 
1 0 thereof is an H. pylori polypeptide involved in lipid metabolism or a fragment thereof 
encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 35 1 and 
SEQ ID NO: 416. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 
thereof is an H. pylori polypeptide involved in genome replication, transcription, 

1 5 recombination and repair or a fragment thereof encoded by the nucleic acid selected 
from the group consisting of SEQ ID NO: 278, SEQ ID NO: 335, SEQ ID NO: 346, 
SEQ ID NO: 350, SEQ ID NO: 419, SEQ ID NO: 460, SEQ ID NO: 472, SEQ ID NO: 
1000, SEQ ID NO: 1004, SEQ ID NO: 1020, and SEQ ID NO: 1293. 

Particularly preferred is an isolated nucleic acid comprising a nucleotide 

20 sequence encoding an H. pylori secreted polypeptide or a fragment thereof. Such 
nucleic acid is selected from the group consisting of SEQ ID NO: 355, SEQ ID NO: 
1006, SEQ ID NO: 257, SEQ ID NO: 258, SEQ ID NO: 260, SEQ ID NO: 261, SEQ ID 
NO: 264, SEQ ID NO: 265, SEQ ID NO: 268, SEQ ID NO: 270, SEQ ID NO: 272, SEQ 
ID NO: 274, SEQ ID NO: 275, SEQ ID NO: 276, SEQ ID NO: 279, SEQ ID NO: 283, 

25 SEQ ID NO: 284, SEQ ID NO: 287, SEQ ID NO: 288, SEQ ID NO: 289, SEQ ID NO: 
291, SEQ ID NO: 295, SEQ ID NO: 296, SEQ ID NO: 297, SEQ ID NO: 298, SEQ ID 
NO: 300, SEQ ID NO: 301, SEQ ID NO: 302, SEQ ID NO: 303, SEQ ID NO: 304, SEQ 
ID NO: 305, SEQ ID NO: 314, SEQ ID NO: 315, SEQ ID NO: 323, SEQ ID NO: 338, 
SEQ ID NO: 342, SEQ ID NO: 348, SEQ ID NO: 349, SEQ ID NO: 356, SEQ ID NO: 

30 358, SEQ ID NO: 359, SEQ ID NO: 360, SEQ ID NO: 361, SEQ ID NO: 362, SEQ ID 
NO: 363, SEQ ID NO: 367, SEQ ID NO: 370, SEQ ID NO: 371, SEQ ID NO: 373, SEQ 
ID NO: 377, SEQ ID NO: 378, SEQ ID NO: 379, SEQ ID NO: 380, SEQ ID NO: 388, 
SEQ ID NO: 390, SEQ ID NO: 394, SEQ ID NO: 395, SEQ ID NO: 396, SEQ ID NO: 
401, SEQ ID NO: 403, SEQ ID NO: 405, SEQ ID NO: 408, SEQ ID NO: 420, SEQ ID 

35 NO: 426, SEQ ID NO: 428, SEQ ID NO: 429, SEQ ID NO: 432, SEQ ID NO: 439, SEQ 
ID NO: 442, SEQ ID NO: 451, SEQ ID NO: 471, SEQ ID NO: 478, SEQ ID NO: 488, 
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SEQ ID NO: 987, SEQ ID NO: 990, SEQ ID NO: 991, SEQ ID NO: 993, SEQ ID NO: 
1001, SEQIDNO: 1002, SEQ ID NO: 1007, SEQ ID NO: 1013, SEQ ID NO: 1016, 
SEQ ID NO: 1018, SEQ ID NO: 1021, and SEQ ID NO. 1026. 

In another embodiment, the H. pylori secreted polypeptide or a fragment thereof 
5 is an H. pylori polypeptide involved in secretion and adhesion or a fragment thereof 
encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 355 and 
SEQ ID NO: 1006. 

Particularly preferred is an isolated nucleic acid comprising a nucleotide 
sequence encoding an H. pylori cellular polypeptide or a fragment thereof. Such nucleic 

1 0 acid is selected from the group consisting of SEQ ID NO: 256, SEQ ID NO: 267, SEQ 
ID NO: 282, SEQ ID NO: 306, SEQ ID NO: 307, SEQ ID NO: 308, SEQ ID NO: 309, 
SEQ ID NO. 310, SEQ ID NO: 316, SEQ ID NO: 317, SEQ ID NO: 332, SEQ ID NO: 
333. SEQ ID NO: 336, SEQ ID NO: 337, SEQ ID NO: 339, SEQ ID NO: 340. SEQ ID 
NO: 341, SEQ ID NO: 344, SEQ ID NO: 369, SEQ ID NO: 376, SEQ ID NO: 382, SEQ 

1 5 ID NO: 386, SEQ ID NO: 423, SEQ ID NO: 431, SEQ ID NO: 435, SEQ ID NO: 436, 
SEQ ID NO: 458, SEQ ID NO: 462, SEQ ID NO: 475, SEQ ID NO: 476, SEQ ID NO: 
479, SEQ ID NO: 480, SEQ ID NO: 481, SEQ ID NO: 482, SEQ ID NO: 483, SEQ ID 
NO: 484, SEQ ID NO: 485, SEQ ID NO: 486, SEQ ID NO: 487, SEQ ID NO: 489, SEQ 
ID NO: 984, SEQ ID NO: 994, SEQ ID NO: 1024, and SEQ ID NO: 1036. 

20 Particularly preferred is a purified or isolated H. pylori cell envelope polypeptide 

or a fragment thereof, wherein the polypeptide is selected from the group consisting of 
SEQ ID NO: 746, SEQ ID NO: 754, SEQ ID NO: 757, SEQ ID NO: 768, SEQ ID NO: 
771, SEQ ID NO: 776, SEQ ID NO: 783, SEQ ID NO: 785, SEQ ID NO: 790, SEQ ID 
NO: 802, SEQ ID NO: 803, SEQ ID NO: 804, SEQ ID NO: 812, SEQ ID NO: 818, SEQ 

25 ID NO: 820, SEQ ID NO: 882, SEQ ID NO: 844, SEQ ID NO: 855, SEQ ID NO: 857, 
SEQ ID NO: 859, SEQ ID NO: 866, SEQ ID NO: 875, SEQ ID NO: 882, SEQ ID NO: 
883, SEQ ID NO: 888, SEQ ID NO: 889, SEQ ID NO: 893, SEQ ID NO: 895, SEQ ID 
NO: 900, SEQ ID NO: 901, SEQ ID NO: 903, SEQ ID NO: 918, SEQ ID NO: 924, SEQ 
ID NO: 925, SEQ ID NO: 932, SEQ ID NO: 935, SEQ ID NO: 936, SEQ ID NO: 940, 

30 SEQ ID NO: 941, SEQ ID NO: 943, SEQ ID NO: 944, SEQ ID NO: 957, SEQ ID NO: 
959, SEQ ID NO: 960, SEQ ID NO: 1037, SEQ ID NO: 1043, SEQ ID NO: 1062, SEQ 
ID NO: 1065, SEQ ID NO: 1068, SEQ ID NO: 1069, SEQ ID NO: 1083, SEQ ID NO: 
1086, SEQ ID NO: 750, SEQ ID NO: 777, SEQ ID NO: 817, SEQ ID NO: 865, SEQ ID 
NO: 890, SEQ ID NO: 913, SEQ ID NO: 945, SEQ ID NO: 956, SEQ ID NO: 1052, 

35 SEQ ID NO: 1063, SEQ ID NO: 1077, SEQ ID NO: 1297, SEQ ID NO: 1298, SEQ ID 
NO: 810, SEQ ID NO: 816, SEQ ID NO: 916, SEQ ID NO: 928, SEQ ID NO: 929, SEQ 
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ID NO: 938, SEQ ID NO: 939, SEQ ID NO: 958, SEQ ID NO: 1050, SEQ ID NO: 
1081, SEQ ID NO: 1085, SEQ ID NO: 745, SEQ ID NO: 843, SEQ ID NO: 906. SEQ 
ID NO: 1073, SEQ ID NO: 872, SEQ ID NO: 880, SEQ ID NO: 1064, SEQ ID NO: 
1066, SEQ ID NO: 845, SEQ ID NO: 863, SEQ ID NO: 891, SEQ ID NO: 912, SEQ ID 
5 NO: 1 076, SEQ ID NO: 954, SEQ ID NO: 772, SEQ ID NO: 1 042, SEQ ID NO: 902, 
SEQ ID NO: 898, SEQ ID NO: 1 07 1 , SEQ ID NO: 78 1 , SEQ ID NO: 908, SEQ ID NO: 
921, SEQ ID NO: 1046, SEQ ID NO: 1079, SEQ ID NO: 968, SEQ ID NO: 905, SEQ 
ID NO: 744, SEQ ID NO: 784, SEQ ID NO: 825, SEQ ID NO: 834, SEQ ID NO: 909, 
SEQ ID NO: 915, and SEQ ID NO: 934. 
1 0 In another embodiment, the H. pylori cell envelope polypeptide or a fragment 

thereof is an H. pylori outer membrane polypeptide or a fragment thereof selected from 
the group consisting of SEQ ID NO: 746, SEQ ID NO: 754, SEQ ID NO: 757, SEQ ID 
NO: 768, SEQ ID NO: 771, SEQ ID NO: 776, SEQ ID NO: 783, SEQ ID NO: 785, SEQ 
ID NO: 790, SEQ ID NO: 802, SEQ ID NO: 803, SEQ ID NO: 804, SEQ ID NO: 812, 
15 SEQ ID NO: 818, SEQ ID NO: 820, SEQ ID NO: 882, SEQ ID NO: 844, SEQ ID NO: 
855, SEQ ID NO: 857, SEQ ID NO: 859, SEQ ID NO: 866, SEQ ID NO: 875, SEQ ID 
NO: 882, SEQ ID NO: 883, SEQ ID NO: 888, SEQ ID NO: 889, SEQ ID NO: 893, SEQ 
ID NO. 895, SEQ ID NO: 900, SEQ ID NO: 901, SEQ ID NO: 903, SEQ ID NO: 918, 
SEQ ID NO: 924, SEQ ID NO: 925, SEQ ID NO: 932, SEQ ID NO: 935, SEQ ID NO: 
20 936, SEQ ID NO: 940, SEQ ID NO: 941 , SEQ ID NO: 943, SEQ ID NO: 944, SEQ ID 
NO: 957, SEQ ID NO: 959, SEQ ID NO: 960, SEQ ID NO: 1037, SEQ ID NO: 1043, 
SEQ ID NO: 1062, SEQ ID NO: 1065, SEQ ID NO: 1068, SEQ ID NO: 1069, SEQ ID 
NO: 1083, SEQ ID NO: 1086, SEQ ID NO: 750, SEQ ID NO: 777, SEQ ID NO: 817, 
SEQ ID NO: 865, SEQ ID NO: 890, SEQ ID NO: 913, SEQ ID NO: 945, SEQ ID NO: 
25 956, SEQ ID NO: 1052, SEQ ID NO: 1063, SEQ ID NO: 1077, SEQ ID NO: 1297, SEQ 
ID NO: 1298, SEQ ID NO: 810, SEQ ID NO: 816, SEQ ID NO: 916, SEQ ID NO: 928, 
SEQ ID NO: 929, SEQ ID NO: 938, SEQ ID NO: 939, SEQ ID NO: 958, SEQ ID NO: 
1050, SEQ ID NO: 1081, SEQ ID NO: 1085, SEQ ID NO: 745, SEQ ID NO: 843, SEQ 
ID NO: 906, SEQ ID NO: 1073, SEQ ID NO: 872, SEQ ID NO: 880, SEQ ID NO: 
30 1064, and SEQ ID NO: 1066. 

In another embodiment, the H. pylori outer membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide having a terminal phenylalanine residue or a 
fragment thereof selected from the group consisting of SEQ ID NO: 746, SEQ ID NO: 
754, SEQ ID NO: 757, SEQ ID NO: 768, SEQ ID NO: 771, SEQ ID NO: 776, SEQ ID 
35 NO: 783, SEQ ID NO: 785, SEQ ID NO: 790, SEQ ID NO: 802, SEQ ID NO: 803, SEQ 
ID NO: 804, SEQ ID NO: 812, SEQ ID NO: 818, SEQ ID NO: 820, SEQ ID NO: 882, 
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SEQ ID NO: 844, SEQ ID NO: 855, SEQ ID NO: 857, SEQ ID NO: 859, SEQ ID NO: 
866, SEQ ID NO: 875, SEQ ID NO: 882, SEQ ID NO: 883, SEQ ID NO: 888, SEQ ID 
NO: 889, SEQ ID NO: 893, SEQ ID NO: 895, SEQ ID NO: 900, SEQ ID NO: 901, SEQ 
ID NO: 903, SEQ ID NO: 918, SEQ ID NO: 924, SEQ ID NO: 925, SEQ ID NO: 932, 
5 SEQ ID NO: 935, SEQ ID NO: 936, SEQ ID NO: 940, SEQ ID NO: 941, SEQ ID NO: 
943, SEQ ID NO: 944, SEQ ID NO: 957, SEQ ID NO: 959, SEQ ID NO: 960, SEQ ID 
NO: 1037, SEQ ID NO: 1043, SEQ ID NO: 1062, SEQ ID NO: 1065, SEQ ID NO: 
1068, SEQ ID NO: 1069, SEQ ID NO: 1083, and SEQ ID NO: 1086. 

In another embodiment, the H. pylori outer membrane polypeptide or a fragment 
10 thereof is an H. pylori polypeptide having a C-terminal tyrosine cluster or a fragment 
thereof selected from the group consisting of SEQ ID NO: 777, SEQ ID NO: 817, SEQ 
ID NO: 865, SEQ ID NO: 890, SEQ ID NO: 913, SEQ ID NO: 945, SEQ ID NO: 956, 
SEQ ID NO: 1052, SEQ ID NO: 1063, SEQ ID NO: 1077, SEQ ID NO: 1297, and SEQ 
ID NO: 1298. 

15 In another embodiment, the H. pylori outer membrane polypeptide or a fragment 

thereof is an H. pylori polypeptide having a terminal phenylalanine residue and a C- 
terminal tyrosine cluster or a fragment thereof selected from the group consisting of 
SEQ ID NO: 810, SEQ ID NO: 816, SEQ ID NO: 916, SEQ ID NO: 928, SEQ ID NO: 
929, SEQ ID NO: 938, SEQ ID NO: 939, SEQ ID NO: 958, SEQ ID NO: 1050, SEQ ID 

20 NO: 1 08 1 , and SEQ ID NO: 1 085. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori inner membrane polypeptide or a fragment thereof selected from 
the group consisting of SEQ ID NO: 845, SEQ ID NO: 863, SEQ ID NO: 891 , SEQ ID 
NO: 912, SEQ ID NO: 1076, SEQ ID NO: 954, SEQ ID NO: 772, SEQ ID NO: 1042, 

25 SEQ ID NO: 902, SEQ ID NO: 898, SEQ ID NO: 1071, SEQ ID NO: 781, SEQ ID NO: 
908, SEQ ID NO: 921, SEQ ID NO: 1046, and SEQ ID NO: 1079. 

In another embodiment, the H. pylori inner membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide involved in outer membrane and cell wall synthesis 
or a fragment thereof comprising an amino acid sequence of SEQ ID NO: 845. 

30 In another embodiment, the H. pylori inner membrane polypeptide or a fragment 

thereof is an H. pylori polypeptide involved in energy conversion or a fragment thereof 
selected from the group consisting of SEQ ID NO: 863, SEQ ID NO: 891, SEQ ID NO: 
912, and SEQ ID NO: 1076. 

In another embodiment, the H. pylori inner membrane polypeptide or a fragment 

35 thereof is an H. pylori polypeptide involved in cofactor metabolism or a fragment 
thereof comprising an amino acid sequence of SEQ ID NO: 954. 
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In another embodiment, the K pylori inner membrane polypeptide or a fragment 
thereof is an K pylori polypeptide involved in secretion or adhesion or a fragment 
thereof selected from the group consisting of SEQ ID NO: 772 and SEQ ID NO: 1042. 

In another embodiment, the H. pylori inner membrane polypeptide or a fragment 
5 thereof is an H. pylori polypeptide involved in transport or a fragment thereof selected 
from the group consisting of SEQ ID NO: 898 and SEQ ID NO: 1071. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori flagellar polypeptide or a fragment thereof comprising an amino 
acid sequence of SEQ ID NO: 968. 
10 In another embodiment, the //. pylori cell envelope polypeptide or a fragment 

thereof is an K pylori transport polypeptide or a fragment thereof comprising an amino 
acid sequence of SEQ ID NO: 905. 

Particularly preferred is a purified or isolated H. pylori cytoplasmic polypeptide 
or a fragment thereof, wherein the polypeptide is selected from the group consisting of 
15 SEQ ID NO: 961, SEQ ID NO: 1087, SEQ ID NO: 848, SEQ ID NO: 948, SEQ ID NO: 
952, SEQ ID NO: 1084, SEQ ID NO: 836, SEQ ID NO: 874, SEQ ID NO: 878, SEQ ID 
NO: 946, SEQ ID NO: 1057, SEQ ID NO: 842, SEQ ID NO: 907, SEQ ID NO: 769, 
SEQ ID NO: 826, SEQ ID NO: 837, SEQ ID NO: 841, SEQ ID NO: 910, SEQ ID NO: 
951, SEQ ID NO: 963, SEQ ID NO: 1054, SEQ ID NO: 1058, SEQ ID NO: 1074, SEQ 
20 ID NO: 1296, SEQ ID NO: 809, SEQ ID NO: 813, SEQ ID NO: 815, SEQ ID NO: 821, 
SEQ ID NO: 838, SEQ ID NO: 93 1 , SEQ ID NO: 937, SEQ ID NO: 955, SEQ ID NO: 
981, SEQ ID NO: 982, SEQ ID NO: 1049, SEQ ID NO: 1051, SEQ ID NO: 1059, and 
SEQ ID NO: 1082. 

In another embodiment, the K pylori cytoplasmic polypeptide or a fragment 
25 thereof is an H. pylori polypeptide involved in energy conversion or a fragment thereof 
selected from the group consisting of SEQ ID NO: 961 and SEQ ID NO: 1087. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 
thereof is an H. pylori polypeptide involved in amino acid metabolism and transport or a 
fragment thereof selected from the group consisting of SEQ ID NO: 848 and SEQ ID 
30 NO: 948. 

In another embodiment, the K pylori cytoplasmic polypeptide or a fragment 
thereof is an H. pylori polypeptide involved in nucleotide metabolism and transport or a 
fragment thereof selected from the group consisting of SEQ ID NO: 952 and SEQ ID 
NO: 1084. 

35 In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 

thereof is an H. pylori polypeptide involved in cofactor metabolism or a fragment 
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thereof selected from the group consisting of SEQ ID NO: 836, SEQ ID NO: 874, SEQ 
ID NO: 878, SEQ ID NO: 946, and SEQ ID NO: 1057. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 
thereof is an H. pylori polypeptide involved in lipid metabolism or a fragment thereof 

5 selected from the group consisting of SEQ ID NO: 842, SEQ ID NO: 907. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 
thereof is an H. pylori polypeptide involved in genome replication, transcription, 
recombination and repair or a fragment thereof selected from the group consisting of 
SEQ ID NO: 769, SEQ ID NO: 826, SEQ ID NO: 837, SEQ ID NO: 841, SEQ ID NO: 

10 910, SEQ ID NO: 951, SEQ ID NO: 963, SEQ ID NO: 1054, SEQ ID NO: 1058, SEQ 
ID NO: 1074, and SEQ ID NO: 1296. 

Particularly preferred is a purified or isolated H. pylori secreted polypeptide or a 
fragment thereof, wherein the polypeptide is selected from the group consisting of SEQ 
ID NO: 846, SEQ ID NO: 1060, SEQ ID NO: 748, SEQ ID NO: 749, SEQ ID NO: 75 1 , 

1 5 SEQ ID NO: 752, SEQ ID NO: 755, SEQ ID NO: 756, SEQ ID NO: 759, SEQ ID NO: 
761, SEQ ID NO: 763, SEQ ID NO: 765, SEQ ID NO: 766, SEQ ID NO: 767, SEQ ID 
NO: 770, SEQ ID NO. 774, SEQ ID NO: 775, SEQ ID NO: 778, SEQ ID NO: 779, SEQ 
ID NO: 780, SEQ ID NO: 782, SEQ ID NO: 786, SEQ ID NO: 787, SEQ ID NO: 788, 
SEQ ID NO: 789, SEQ ID NO: 791, SEQ ID NO: 792, SEQ ID NO: 793, SEQ ID NO: 

20 794, SEQ ID NO: 795, SEQ ID NO: 796, SEQ ID NO: 805, SEQ ID NO: 806, SEQ ID 
NO: 814, SEQ ID NO: 829, SEQ ID NO: 833, SEQ ID NO: 839, SEQ ID NO: 840, SEQ 
ID NO: 847, SEQ ID NO: 849, SEQ ID NO: 850, SEQ ID NO: 851, SEQ ID NO. 852, 
SEQ ID NO: 853, SEQ ID NO: 854, SEQ ID NO: 858, SEQ ID NO: 861, SEQ ID NO: 
862, SEQ ID NO: 864, SEQ ID NO: 868, SEQ ID NO: 869, SEQ ID NO: 870, SEQ ID 

25 NO: 871, SEQ ID NO: 879, SEQ ID NO: 881, SEQ ID NO: 885, SEQ ID NO: 886, SEQ 
ID NO: 887, SEQ ID NO: 892, SEQ ID NO: 894, SEQ ID NO: 896, SEQ ID NO: 899, 
SEQ ID NO: 911, SEQ ID NO: 917, SEQ ID NO: 919, SEQ ID NO: 920, SEQ ID NO: 
923, SEQ ID NO: 930, SEQ ID NO: 933, SEQ ID NO: 942, SEQ ID NO: 962, SEQ ID 
NO: 969, SEQ ID NO: 979, SEQ ID NO: 1041, SEQ ID NO: 1044, SEQ ID NO: 1045, 

30 SEQ ID NO: 1047, SEQ ID NO: 1055, SEQ ID NO: 1056, SEQ ID NO: 1061, SEQ ID 
NO: 1067, SEQ ID NO: 1070, SEQ ID NO: 1072, SEQ ID NO: 1075, and SEQ ID NO: 
1080. 

In another embodiment, the H. pylori secreted polypeptide or a fragment thereof 
is an H. pylori polypeptide involved in secretion and adhesion or a fragment thereof 
35 selected from the group consisting of SEQ ID NO: 846 and SEQ ID NO: 1 060. 



bnsdociD' <wo 07',7rv^fli n > 



BNS page 17 



WO 97/37044 




PCT/US97/05223 



- 16- 



Particularly preferred is a purified or isolated H. pylori cellular polypeptide or a 
fragment thereof, wherein the polypeptide is selected from the group consisting of SEQ 
ID NO: 747, SEQ ID NO: 758, SEQ ID NO: 773, SEQ ID NO: 797, SEQ ID NO: 798, 
SEQ ID NO: 799, SEQ ID NO: 800, SEQ ID NO: 801, SEQ ID NO: 807, SEQ ID NO: 
5 808, SEQ ID NO: 823, SEQ ID NO: 824, SEQ ID NO: 827, SEQ ID NO: 828, SEQ ID 
NO: 830, SEQ ID NO: 831, SEQ ID NO: 832, SEQ ID NO: 835, SEQ ID NO: 860, SEQ 
ID NO: 867, SEQ ID NO: 873, SEQ ID NO: 877, SEQ ID NO: 914, SEQ ID NO: 922, 
SEQ ID NO: 926, SEQ ID NO: 927, SEQ ID NO: 949, SEQ ID NO: 953, SEQ ID NO: 
966, SEQ ID NO: 967, SEQ ID NO: 970, SEQ ID NO: 971, SEQ ID NO: 972, SEQ ID 
10 NO: 973, SEQ ID NO: 974, SEQ ID NO: 975, SEQ ID NO: 976, SEQ ID NO: 977, SEQ 
ID NO: 978, SEQ ID NO: 980, SEQ ID NO: 1038, SEQ ID NO: 1048, SEQ ID NO: 
1078, and SEQ ID NO: 1090. 

In another aspect, the invention pertains to any individual H. pylori polypeptide 
member or nucleic acid encoding such a member from the above-identified groups of H. 
1 5 pylori polypeptides. 

In another aspect, the invention features nucleic acids capable of binding mRN A 
of H, pylori. Such nucleic acid is capable of acting as antisense nucleic acid to control 
the translation of mRNA of H. pylori. A further aspect features a nucleic acid which is 
capable of binding specifically to an K pylori nucleic acid. These nucleic acids are also 
20 referred to herein as complements and have utility as probes and as capture reagents. 

In another aspect, the invention features an expression system comprising an 
open reading frame corresponding to H. pylori nucleic acid. The nucleic acid further 
comprises a control sequence compatible with an intended host. The expression system 
is useful for making polypeptides corresponding to H. pylori nucleic acid. 
25 In another aspect, the invention features a cell transformed with the expression 

system to produce H. pylori polypeptides. 

In another aspect, the invention features a method of generating antibodies 
against K pylori polypeptides which are capable of binding specifically to H. pylori 
polypeptides. Such antibody has utility as reagents for immunoassays to evaluate the 
30 abundance and distribution of H. /?y/or/-specific antigens. 

In another aspect, the invention features a method of generating vaccines for 
immunizing an individual against H. pylori. The method includes: immunizing a 
subject with an H. pylori polypeptide, e.g., a surface or secreted polypeptide, or active 
portion thereof, and a pharmaceutical ly acceptable carrier. Such vaccines have 
35 therapeutic and prophylactic utilities. 
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In another aspect, the invention provides a method for generating a vaccine 
comprising a modified immunogenic H. pylori polypeptide, e.g., a surface or secreted 
polypeptide, or active portion thereof, and a pharmacologically acceptable carrier. 

In another aspect, the invention features a method of evaluating a compound, e.g. 
5 a polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an // 
pylori polypeptide. The method includes: contacting the candidate compound with an 
H. pylori polypeptides and determining if the compound binds or otherwise interacts 
with an H. pylori polypeptide. Compounds which bind H. pylori are candidates as 
activators or inhibitors of the bacterial life cycle. These assays can be performed in vitro 
10 or in vivo. 

In another aspect, the invention features a method of evaluating a compound, e.g. 

a polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an H. 

pylori nucleic acid, e.g., DNA or RNA. The method includes: contacting the candidate 

compound with an H. pylori nucleic acid and determining if the compound binds or 
1 5 otherwise interacts with an H. pylori polypeptide. Compounds which bind K pylori are 

candidates as activators or inhibitors of the bacterial life cycle. These assays can be 

performed in vitro or in vivo. 

The invention features K pylori polypeptides, preferably a substantially pure 

preparation of an H. pylori polypeptide, or a recombinant K pylori polypeptide. In 
20 preferred embodiments: the polypeptide has biological activity; the polypeptide has an 

amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% homologous to 

an amino acid sequence contained in the Sequence Listing; the polypeptide has an amino 

acid sequence essentially the same as an amino acid sequence in the Sequence Listing; 

the polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acid residues in length; the 
25 polypeptide includes at least 5, preferably at least 10, more preferably at least 20, more 

preferably at least 50, 100, or 150 contiguous amino acid residues of a polypeptide 

contained in the Sequence Listing. 

In preferred embodiments: the H. pylori polypeptide is encoded by a nucleic 

acid contained in the Sequence Listing, or by a nucleic acid having at least 60%, 70%, 
30 80%, 90%, 95%, 98%, or 99% homology with a nucleic acid shown in the Sequence 

Listing. 

In a preferred embodiment, the subject H. pylori polypeptide differs in amino 
acid sequence at 1, 2, 3, 5, 10 or more residues from a sequence in the Sequence Listing. 
The differences, however, are such that the K pylori polypeptide exhibits an K pylori 
35 biological activity, e.g., the K pylori polypeptide retains a biological activity of a 
naturally occurring H. pylori enzyme. 
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in preferred embodiments, the polypeptide includes all or a fragment of an amino 
acid sequence contained in the Sequence Listing; fused, in reading frame, to additional 
amino acid residues, preferably to residues encoded by genomic DNA 5' to the genomic 
DNA which encodes a sequence contained in the Sequence Listing. 
5 In yet other preferred embodiments, the H. pylori polypeptide is a recombinant 

fusion protein having a first H. pylori polypeptide portion and a second polypeptide 
portion, e.g., a second polypeptide portion having an amino acid sequence unrelated to 
H. pylori. The second polypeptide portion can be, e.g., any of glutathione-S-transferase ? 
a DNA binding domain, or a polymerase activating domain. In preferred embodiment 
10 the fusion protein can be used in a two-hybrid assay. 

Polypeptides of the invention include those which arise as a result of alternative 
transcription events, alternative RNA splicing events, and alternative translational and 
postradiational events. 

The invention also encompasses an immunogenic component which includes an 
15 H. pylori polypeptide in an immunogenic preparation; the immunogenic component 
being capable of eliciting an immune response specific for the H. pylori polypeptide, 
e.g., a humoral response, an antibody response, or a cellular response. In preferred 
embodiments, the immunogenic component comprises at least one antigenic determinant 
from a polypeptide contained in the Sequence Listing. 
20 In another aspect, the invention provides a substantially pure nucleic acid having 

a nucleotide sequence which encodes an H. pylori polypeptide. In preferred 
embodiments: the encoded polypeptide has biological activity; the encoded polypeptide 
has an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% 
homologous to an amino acid sequence contained in the Sequence Listing; the encoded 
25 polypeptide has an amino acid sequence essentially the same as an amino acid sequence 
in the Sequence Listing; the encoded polypeptide is at least 5, 10, 20, 50, 100, or 150 
amino acids in length; the encoded polypeptide comprises at least 5, preferably at least 
10, more preferably at least 20, more preferably at least 50, 100, or 1 50 contiguous 
amino acids contained in the Sequence Listing. 
30 In preferred embodiments: the nucleic acid is that as shown in the Sequence 

Listing; the nucleic acid is at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% 
homologous with a nucleic acid sequence contained in the Sequence Listing. 

In a preferred embodiment, the encoded H. pylori polypeptide differs (e.g., by 
amino acid substitution, addition or deletion of at least one amino acid residue) in amino 
35 acid sequence at 1, 2, 3, 5, 10 or more residues, from a sequence in the Sequence 

Listing. The differences, however, are such that: the K pylori encoded polypeptide 
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exhibits a H. pylori biological activity, e.g., the encoded H. pylori enzyme retains a 
biological activity of a naturally occurring H. pylori. 

In preferred embodiments, the encoded polypeptide includes all or a fragment of 
an amino acid sequence contained in the Sequence Listing; fused, in reading frame, to 
5 additional amino acid residues, preferably to residues encoded by genomic DNA 5' to 
the genomic DNA which encodes a sequence contained in the Sequence Listing. 

In preferred embodiments, the subject H. pylori nucleic acid will include a 
transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or 
transcriptional enhancer sequence, operably linked to the H. pylori gene sequence, e.g., 
10 to render the H. pylori gene sequence suitable for expression in a recombinant host cell. 

In yet a further preferred embodiment, the nucleic acid which encodes an H. 
pylori polypeptide of the invention, hybridizes under stringent conditions to a nucleic 
acid probe corresponding to at least 8 consecutive nucleotides of a nucleic acid 
contained in the Sequence Listing; more preferably to at least 12 consecutive nucleotides 
15 of a nucleic acid contained in the Sequence Listing; more preferably to at least 20 
consecutive nucleotides of a nucleic acid contained in the Sequence Listing; more 
preferably to at least 40 consecutive nucleotides of a nucleic acid contained in the 
Sequence Listing. 

In a preferred embodiment, the nucleic acid encodes a peptide which differs by at 
20 least one amino acid residue from the sequences shown in the Sequence Listing. 

In a preferred embodiment, the nucleic acid differs by at least one nucleotide 
from a nucleotide sequence shown in the Sequence Listing which encodes amino acids 
shown in the Sequence Listing. 

In another aspect, the invention encompasses: a vector including a nucleic acid 
25 which encodes an H. pylori polypeptide or an H. pylori polypeptide variant as described 
herein; a host cell transfected with the vector; and a method of producing a recombinant 
H. pylori polypeptide or H. pylori polypeptide variant; including culturing the cell, e.g., 
in a cell culture medium, and isolating the H. pylori or H. pylori polypeptide variant 
e.g., from the cell or from the cell culture medium. 
30 In another aspect, the invention features, a purified recombinant nucleic acid 

having at least 50%, 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a 
sequence contained in the Sequence Listing. 

The invention also provides a probe or primer which includes a substantially 
purified oligonucleotide. The oligonucleotide includes a region of nucleotide sequence 
35 which hybridizes under stringent conditions to at least 10 consecutive nucleotides of 
sense or antisense sequence contained in the Sequence Listing, or naturally occurring 
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mutants thereof. In preferred embodiments, the probe or primer further includes a label 
group attached thereto. The label group can be, e.g., a radioisotope, a fluorescent 
compound, an enzyme, and/or an enzyme co-factor. Preferably, the oligonucleotide is at 
least 10 and less than 20, 30, 50, 100, or 150 nucleotides in length. 
5 The invention further provides nucleic acids, e.g., RNA or DNA, encoding a 

polypeptide of the invention. This includes double stranded nucleic acids as well as 
coding and antisense single strands. 

The H. pylori strain, from which genomic sequences have been sequenced, has 
been deposited in the American Type Culture Collection(ATCC) as strain HP-J99. 

10 Included in the invention are: allelic variations; natural mutants; induced 

mutants; proteins encoded by DNA that hybridizes under high or low stringency 
conditions to a nucleic acid which encodes a polypeptide as shown in the Sequence 
Listing (for definitions of high and low stringency see Current Protocols in Molecular 
Biology, John Wiley & Sons, New York, 1989, 6.3.1 - 6.3.6, hereby incorporated by 

1 5 reference); and, polypeptides specifically bound by antisera to K pylori polypeptides, 
especially by antisera to an active site or binding domain of H. pylori polypeptide. The 
invention also includes fragments, preferably biologically active fragments. These and 
other polypeptides are also referred to herein as K pylori polypeptide analogs or 
variants. 

20 Putative functions have been determined for several of the H. pylori polypeptides 

of the invention, as shown in Table 1 . 

Accordingly, uses of the claimed H. pylori polypeptides in these identified 

functions are also within the scope of the invention. 

In addition, the present invention encompasses H, pylori polypeptides 
25 characterized as shown in Table 1 below, including: H. pylori cell envelope proteins, //. 

pylori secreted proteins, H. pylori cytoplasmic proteins and H. pylori cellular proteins. 

Members of these groups were identified by BLAST homology searches and by searches 

for secretion signal or transmembrane protein motifs. Polypeptides related by 

significant homology to the polypeptides of Table 1 (as reflected by FASTA compar- 
30 isons of the amino acid sequences and indicated in many cases in Tables 3-6 below) are 

also considered to be classified in the manner of the homolog shown in Table 1 . 



TABLE 1 



ORF NAME 


nt Seq ID# 


aa Seq ID # 


A.CELL ENVELOPE 






A. 1 Outer membrane 






A. 1 .2 Terminal phe residue 






0lael200I_242I8781 J2J8 


255 


746 
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02gel 1622_875260_f3_36 


263 


754 


02gp20706_23632775_O_32 


266 


757 


07apll0l5_239383l2J3_2 


271 


762 


05eel0816 4103408 f2 11 


277 


768 


06ap20306_23437632J3_9 


280 


771 


07ap202 1 6_7227202_f3_ 1 0 


285 


776 


1 4ap 1 08 1 5__20585777_c 1 _1 3 


292 


783 


hp lp 1401 3 J !726503_c2_20 


294 


785 


hp5p 1 564 1 _2 1 698387_c2_20 


299 


790 


02gp20706_ 1 68035 1 3_f 1 _ I 


311 


802 


02gp20706_20365905_f2_8 


312 


803 


02gp208 1 4_3 9848 1 8_f 1 _ 1 


313 


804 


05ae30220 9882767 f2 34 


321 


812 


07epl 1 916^591 3592 J3_l 8 


327 


818 


09zel0333 22460750 f2 6 


329 


820 


1 4ap 1 022 1 _ 1 36893 8 1 _c3_4 


331 


882 


06ce206 1 0_ 1 367 1 57_fl _8.aa 


353 


844 


02ge 1 0 11 6_ 1 68035 1 3 J2_34 


364 


855 


02ge 1011 6_36367936_c 1 ,92 


366 


857 


05cel0910 25598277 f3 3 


368 


859 


06cpl 1217_4881263J2_9 


375 


866 


06ep30223_34409437_f3_94 


384 


875 


06gp7 1 906_970325_c3_l 90 


391 


882 


07ae 10923 24426508 fl 1 


392 


883 


09cp 1 0224_ 1 062966_c3_6 1 


397 


888 


09cp 1 0224_ 141271 5_c3_56 


398 


889 


09cp6 1 003_ 1 456263 7_c2_93 


402 


893 


09cp6 1 003_24063587_c 1 _74 


404 


895 


llae80818 7952 cl 49 


409 


900 


1 1 ap207 1 4_4960432_c3_97 


410 


901 


1 1 ap207 1 4_7227202_f3_43.aa 


412 


903 


hp5p 1 564 1 _ 1 2 1 9528 1 __c 1 _24 


427 


918 


hp6p 1 0903_4398263J3_6.aa 


433 


924 


hp6p!0903_4398263J3_6.aa 


434 


925 


02ae3 1 0 1 0_302083 1 7_f 1_ 14 


441 


932 


02cp 1 06 1 5_26573462_c 1 _45 


444 


935 


03ael0804 12609533 cl 26 


445 


936 


04ep4 1 903 _4 1 0 1 593J2_ 1 0.aa 


449 


940 


05ep 1 08 1 5_26570332_c2_99 


450 


941 


05ep 1 08 1 5_47 1 9 1 75_c I _83 


452 


943 


06cp30603_6792 1 8J234 


453 


944 


llae80818 19632781 c3 57 


466 


957 


1 1 ap207 1 4^340233 1 2_f3_46 


468 


959 


Ileell408 4977193 cl 41.aa 


469 


960 


llae 11922 12586675 f2 1 


983 


1037 


07ee 11402 2458267 c3 108 


985 


1039 


1 lap20714_7227202_£3_40 


989 


1043 


hp7e 1 0 1 92_25598277 - c2 - 1 5 


1008 


1062 


06ep30223_3440943 7_f2_64 


1011 


1065 


09cp 1 0224_ 1 062966__c 1 _44 


1014 


1068 


01ce61016 12931513 c2 106 


1015 


1069 


05epl0815_4719l75__cl_l 15 


1029 


1083 
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05ae30220 4977193 c3 198 


1 017 




A. 1.3 No terminal phe residue 






02ae 11612 22477267 f2 27 


750 

X J7 




A. 1 .4 C-terminal tyrosine cluster motif 






07epl I916_5273452 c3 31 


7RA 
XoO 


TIT 

777 


07ce 11019 22051291 fl 1 


17A 


0 1 "7 

6 I 7 


06cp 11217 19720300 f3 1J 




ox c 

865 


09cd 10^24 429510 c2 46 aa 


inn 


890 


ho4e53394 22864682 c2 86 aa 


4xx 


913 


06eol0615 96156*> f2 41 




945 


09cd6 1 003 492 1 87 c2 80 aa 


/IXC 


956 


14eo 11423 26803801 fl 7 


998 


1052 


hr>4e53394 1972010ft rl OR 


1009 


1063 


ho4e533Q4 10770100 rl OR 


1023 


1077 


01ce6IOl6 407 1R7 rl 170 


1294 


1297 


06en 1 06 1 S 06 1 567 f 1 1 S 


1295 


1298 


.j i cjiuinai pne resiaue ana i„ -terminal tyrosine cluster 






04?el0816 117760R0 r7 70 


319 


810 


06ee 10709 7167S017 fl 7 


^25 


816 


hn5n1557S 11445117 D 70 na 


425 


916 


01cd20708 16114R0R f7 11 


437 


928 


02ael1010 12S04517 fl 7R aa 


438 


929 


04eo41903 26757937 fl 16 


447 


938 


04eo4l903 26757937 fl 16 




939 


llaeROSIR 7700677 r7 Si 


467 


958 


13ael0610 15017 f7 1 

1 JOblUUIV JJ71Z IX J 


996 


1050 


01cd20708 110R6007 fl 77 


1027 


1081 


14cnl 1908 2471R9S4 rl 6R 


J 03 1 


1085 


A 1 6 Via hnTTiAlov 
' i * «-* » ia iiuuimcy 






OlaellOlO 40688 c2 18 


x^4 


745 


02&D208I4 24415958 fl 9 




760 


09cd 11003 5945252 f2 4 




o i n 

o 19 


06cd30603 ^3476568 c2 131 aa 




O A 

84.? 


09cp6 1 003 5945252_f 1 _5 


406 


897 


I4ee41924 71S77767 rl 107 


413 


906 


07ee 11402 10750567 r7 R6 


luiv 


1073 


A 1 7 Other outer mpmhnnp nrnt^inc 






06eol0615 9842 fl 46 


iq i 
JO I 


872 


06en 10409 119R427 f? 17 




o orv 

880 


06eol0615 9842 fl 5 


1 A1 A 

JU 1 u 


1064 


06pn 10400 11QR477 f7 17 


1012 


1066 


AV.X illllCI IVJClilDlCUlC 






A.2.1 Proteins involved in outer membrane and cell wall 
synthesis 






06ep30223_4698838_f2_55 


354 


845 


A. 2. 2 Proteins involved in energy conversion 






06ce20610 4331338 D 18 


372 


863 


09cp 10224_44847 1 8_c 1_38 


400 


891 


hp4e I 3394_5964452_c297 


421 


912 


hp4e!3394_ 15828963 c2_90 


1022 


1076 


A.2.3 Proteins involved in cofactor metabolism 






06gp7 1 906_25478 1 92_c 1 _ 1 3 1 


463 


954 



WO 97/37044 



PCT/US97/05223 



-23 - 



A. 2. 4 Proteins involved in secretion and adhesion 






06cp30603__23452_c3_80 


281 


772 


09cp 1 07 1 3_23452_c3_ 1 95 


988 


1042 


A. 2. 5 Via homolgy 






1 Iap207l4_527l967_cl_60 


41 1 


902 


A. 2. 6 Proteins involved in transport 






Hae808I8 11188791 c3 60 


407 


898 


14cpl 1908J25593768_c3_97 


1017 


1071 


A.2.7 Other inner membrane proteins 






13ael0712 14100018 f2 12 


290 


781 


hplp!3939_25397327_0_22 


417 


908 


hp5p 1 5870_ 1 4350428_f 1 _ 1 


430 


921 


06ge2050 1 _ 1 4 1 000 1 8_c 1 _34 


992 


1046 


05ae3 0220_ 1 43 50428 J3_9 1 


1025 


1079 


A.3 Flagellar proteins 






hp4el 3394_3368767_c l_80 


477 


968 


A. 4 Transport proteins 






14ce31519 15635927 f3 15 


414 


905 


A,5 Other cell envelope proteins 






04cpl 1 202_24256567 - c3_l 1 7 


253 


744 


29ge3032 1 _34 1 578 1 2_f3_ 1 0 


293 


784 


29ge3 032 1 _ 1 29 1 3 562_f I _ 1 


334 


825 


hp6p 1 0233_ 1 2273 302_f 1 _ 1 


343 


834 


hp2el091 l_24855312_cl_69 


418 


909 


hp5pl5575_2930O31 l_cl_29 


424 


915 


02ae3 1 0 1 0_5085 1 62_c 1 _47 


443 


934 


B. CYTOPLASMIC PROTEINS 






B. 1 Proteins involved in energy conversion 






1 lgel0308_5256J2J 


470 


961 


1 lgel0308_24609417J2_l 


1033 


1087 


B.2 Proteins involved in amino acid metabolism and transport 






06ep 1191 7_24803 1 53_c3„24 


357 


848 


06epl 1202_4884677_cl_17 


457 


948 


B.3 Proteins involved in nucleotide metabolism and transport 






06ep30223_23476067_c I_l 1 9 


461 


952 


06ep30223_23476067_cl_l 15 


1030 


1084 


B.4 Proteins involved in cofactor metabolism 






07apl 1213_35156577_cl_24 


345 


836 


06ep30223_23 557202_c2_ 1 30 


383 


874 


06ep30223_5 109443_cl_109 


387 


878 


06epll202_133293_cl_19 


455 


946 


07ee50709_35 1 56577_G_80 


1003 


1057 


B.5 Proteins involved in lipid metabolism 






hp6e 1 2267_1 4650278 J3_29 


351 


842 


I4ee41924 23834800 f2 32 


416 


907 


B.6 Proteins involved in mRNA translation and ribosome 
biogenesis 






I4ee41924 16282067 cl 72 


473 


964 


07eell402 19565702 c2 88 


1034 


1088 


" B.7 Proteins involved in genome replicati n, transcription, 
recombination and repair" 






05eel08l6 4687651 cl 22 


278 


769 
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29ge3032 1 _ 1 35253 J2_6 


335 


826 


07ap8060 1_9764 1 3_ D_9 


346 


837 


I4ce21516 85786 fl 1 


350 


841 


hp2el0911_3349_cl_63 


419 


910 


06ep30223_ I 65 I 2_c3_ 1 60 


460 


951 


14ce6l516 13073577 Q 12 


472 


963 


07ee50709 4818967 fl 43 


1000 


1054 


05ae30220 976413 c3 204 


1004 


1058 


07ee50709 4818967 f2 43 


1020 


1074 


hp7el0590J3073577_c3_107 


1293 


1296 


B.8 Other cytoplasmic proteins 






04ge 1 08 1 6_2208653 1 _f2_ 1 0 


318 


809 


05cp2 1 223_472 5443 J3_ 1 4 


322 


813 


06apl 1 1 19J24426508J3_26 


324 


815 


1 2ge 1 03 2 1 _482 1 082_f3_ 1 4 


330 


R? i 

Ox. 1 


hp3p 1 0807^ 1 89075_f2_4 


347 


Rift 

O J o 


02ae310IO 2117087 O 34 


440 


Q1 1 


03ael0804 21698400 c2 32 


446 


~j i 


06gp71906 255041 87_f3_l 12 


464 


QSS 


hp7p 10290 35 1 56558J3 J 5 


490 


981 


hp7p 1 0290_435 1 7 1 8_f 1 6 


491 


982 


I3ael0610 859692 c2 32 


995 


1049 


06apl 1 1 19_24426508_D_27 


997 


ins i 


hp3e 1 1 1 88_47327_f2_9 


1005 


msQ 


07ee50709 26438968 f2 36 


1028 


1082 


C. SECRETED PROTEINS 






C. 1 Proteins involved in secretion and adhesion 






1 2ap 1 0324^48053 1 8_f2_3 


355 


846 


12apl0324 48053 I8_f2_6 


1006 


1060 


C.2 Other secreted proteins 






Olgpl 1016_4103403_c2_13 


257 


748 


02aell612 1074212 fl 1 


258 


749 


02aell612 23598175 fl 2 


260 


751 


02aell612 33203250 cl 51 


261 


752 


02gp20706 J 203402_c3_58 


264 


755 


02gp20706_l 578 1 452_c2_5 1 


265 


756 


02gp20706_4892558J3_19 


268 


759 


04cpl 1 202 2426 1 588J2_23 


270 


761 


05ae30220 21619067 O 56 


272 


763 


07ap 1121 3_3540 1 528_c 1 _2 1 


273 


764 


05ae30220 24882812 c3 103 


274 


765 


05ae30220 25953163 c3 98 


275 


766 


05eel0816 14649077 D 18 


276 


767 


06apl 1 1 19_16594193_fl_9 


279 


770 


06cp3O603_4689068_c3_79 


283 


774 


07aplllll_234693_c3_14 


284 


775 


09cpl 1003J9532625_c3_17 


287 


778 


09cp20502 JS400 1 3 88_c 1 _3 1 


288 


779 


12gp3 1 106_3024126_f2_25 


289 


780 


I3ael0712 29569208 c2 27 


291 


782 


hp2p 1 0272_23697200_D_22 


295 


786 


hp2p 1 0272_26829 1 36_ f 1 _ 1 


296 


787 
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hp5el52ll_819455_c2_24 


297 


788 


hp5p 1 52 1 2_34064750_f2_9 


298 


789 


hp6e 1 0967_23476502_f2_6 


300 


791 


hp6e 1 0967_24882750_f2_7 


301 


792 


hp6e 1 2267 48767 1 8_f2_23 


302 


793 


hp6e20339J I90660_c2_46 


303 


794 


hp6e20339_2 1 492 1 87_c 1 _40 


304 


795 


hp6e20339_34024 1 87_c I _37 


305 


796 


04cp 1 1 202_ 1 6603425_c2_72 


314 


805 


04cpl I202_19797l28_fl_5 


315 


806 


05eel0816 259703 f2 7 


323 


814 


hp2p 1 0272_22692325_f3_2 1 


338 


829 


hp6e20339_243 1 7062_c3_57 


342 


833 


02ge 1 1 622_2 1 695936_c I _54 


348 


839 


1 2ge 1 032 1 _243085 1 3_f3_20 


349 


840 


I4ee4I924 2458267 c2 93 


356 


847 


01celll04 36125337 cl 8 


358 


849 


01ce21104 33203250 c3 87 


359 


850 


02ae31010 34616666 f2 27 


360 


851 


02ae31010 35270000 O 33 


361 


852 


02ae3l010 36132785 fl 29 


362 


853 


02gel01 16_15781452_cl_87 


363 


854 


03ae 1 0804_23485968_c3_47 


367 


858 


06ce206 1 0_29298537_c2_32 


370 


861 


06ce20610 3913967 c3 36 


371 


862 


06cpllll8_212827_cl_17 


373 


864 


06cp30603_2 1 492 1 87_f2_4 1 


377 


868 


06cp30603_34024 1 87_f 1 _20 


378 


869 


06cp30603_34024 1 87_fl _20 


379 


870 


06ep 106151 4649077_O_52 


380 


871 


06ep30223_527 1 902_c I_ 1 06 


388 


879 


06gp7 1 906_2426 1 588_c2J 74 


390 


881 


09cel0413 414011 fl 3 


394 


885 


09cel0413 5865665 fl 4 


395 


886 


09ce52017 29324062 cl 21 


396 


887 


09cp2 1 607 7224 1 87_c2_ 1 2 


401 


892 


09cp6 1 003_19532625_c I _78 


403 


894 


09cp61003_24335762_c3_1 1 1 


405 


896 


llae80818 783127 c3 63 


408 


899 


hp4e 1 3394_35957200_fl_2 1 


420 


911 


hp5 p 1 5 5 75_6 1 407 1 3 J2_ 1 8 


426 


917 


hp5p!5641_24304527_c3_35 
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432 


923 


02ae31010 16833312 f2 19 
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930 
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471 
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hp2e 1 09 1 1 _4882027_c2_87 


474 


965 


hp5p 1 52 J 2_6928 1 32_c3_34 


478 


969 


hp7p!0290_25548812_f3_14 


488 


979 


07ee50709 10213593 f3 77 


986 


1040 
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1 A44 


06gp7l 906^3024 I26_cl 128 


QQ 1 


I VkA -\ 
1 SJH J 


09cp 1 07 1 3_34024 1 87_f 1 3 1 


"/j 


f 047 

I W*T / 


02apl 1 1 I7_23495187_c3_81 


1001 


1 O^ *i 


09cp 1 07 ( 3_34024 1 87_f 1 _3 1 


lorn 


1 OSA 
1 UjO 


hp I e80523_23485968_c2_49 


Iflfl7 
I uu / 


1 A£ 1 


09cel0413 5865665 fl 4 


1 ft 1 -5 
IUI J 


1 flA"? 
1 UO / 


01ce6]016 23609580 c3 139 


1 U 1 O 


1 ATA 
10/0 


14cpl)908 783127 cl 72 


1 fkl 9 
I Ul O 


1072 


hD4e 13394 5088562 f3 54 




1075 


hD8e 10080 19546933 c2 88 


lUZO 


1080 


07ee50709 960952 f2 47 


1035 


1089 


D OTHER CELLULAR PROTFINS 






01eeM6I9 23711062 c3 14 


Zoo 


747 


02eo20706 23866562 c2 53 


zo/ 


758 


06cd30603 23476568 cl 44 


*> o*> 


773 


01eel02O3 35281542 cl 16 


iUo 


797 


01 eel 0203 860166 H 9 


TAT 

JU / 


798 


01 eel 1619 117RR141 r? 11 


308 


799 


01ffell619 24415RR0 c? 17 


joy 


AAA 
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808 
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hp7pl0287 24611325 c2 24 


487 


978 


hp7pl0290 25585941 O 12 


489 


980 


02gel0116 23866562 c3 146 


984 
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hp4p62853 5914693 c3_52 


994 


1048 


07cel0312 4554652 f3 2 
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1078 


hp6p 12244 3948467 c3 88 


1036 


1090 



[In Tabic 1, "nt" represents nucleotide Seq. ID number and "aa" represents amino acid Seq. ID 
number] 



5 Definitions 

A purified preparation or a substantially pure preparation of a polypeptide, as 
used herein, means a polypeptide that has been separated from other proteins, lipids, and 
nucleic acids with which it naturally occurs. Preferably, the polypeptide is also 
separated from substances, e.g., antibodies or gel matrix, e.g., polyacrylamide, which are 

10 used to purify it. Preferably, the polypeptide constitutes at least 1 0, 20, 50 70, 80 or 
95% dry weight of the purified preparation. Preferably, the preparation contains: 
sufficient polypeptide to allow protein sequencing; at least 1, 10, or 100 jag of the 
polypeptide; at least 1, 10, or 100 mg of the polypeptide. 

A purified preparation of cells refers to, in the case of plant or animal cells, an in 

1 5 vitro preparation of cells and not an entire intact plant or animal. In the case of cultured 
cells or microbial cells, it consists of a preparation of at least 1 0% and more preferably 
50% of the subject cells. 

A substantially pure nucleic acid, e.g., a substantially pure DNA, is a nucleic 
acid which is one or both of the following: not immediately contiguous with both of the 

20 coding sequences with which it is immediately contiguous (i.e., one at the 5' end and one 
at the 3' end) in the naturally-occurring genome of the organism from which the nucleic 
acid is derived; or which is substantially free of a nucleic acid with which it occurs in the 
organism from which the nucleic acid is derived. The term includes, for example, a 
recombinant DNA which is incorporated into a vector, e.g., into an autonomously 

25 replicating plasmid or virus, or into the genomic DNA of a prokaryote or eukaryote, or 
which exists as a separate molecule (e.g., a cDNA or a genomic DNA fragment 
produced by PCR or restriction endonuclease treatment) independent of other DNA 
sequences. Substantially pure DNA also includes a recombinant DNA which is part of a 
hybrid gene encoding additional H. pylori DNA sequence. 

30 A "contig" as used herein is a nucleic acid representing a continuous stretch of 

genomic sequence of an organism. 
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An "open reading frame", also referred to herein as ORF, is a region of nucleic 
acid which encodes a polypeptide. This region may represent a portion of a coding 
sequence or a total sequence. 

As used herein, a "coding sequence" is a nucleic acid which is transcribed into 
5 messenger RNA and/or translated into a polypeptide when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are 
determined by a translation start codon at the five prime terminus and a translation stop 
code at the three prime terminus. A coding sequence can include but is not limited to 
messenger RNA, synthetic DNA, and recombinant nucleic acid sequences. 
10 A "complement" of a nucleic acid as used herein referes to an anti-parallel or 

an ti sense sequence that participates in Watson-Crick base-pairing with the original 
sequence. 

A "gene product" is a protein or structural RNA which is specifically encoded for 
by a gene. 

1 5 As used herein, the term "probe" refers to a nucleic acid, peptide or other 

chemical entity which specifically binds to a molecule of interest. Probes are often 
associated with or capable of associating with a label. A label is a chemical moiety 
capable of detection. Typical labels comprise dyes, radioisotopes, luminescent and 
chemiluminescent moieties, fluorophores, enzymes, precipitating agents, amplification 

20 sequences, and the like. Similarly, a nucleic acid, peptide or other chemical entity which 
specifically binds to a molecule of interest and immobilizes such molecule is referred 
herein as a "capture ligand". Capture ligands are typically associated with or capable of 
associating with a support such as nitro-cellulose, glass, nylon membranes, beads, 
particles and the like. The specificity of hybridization is dependent on conditions such 

25 as the base pair composition of the nucleotides, and the temperature and salt 

concentration of the reaction. These conditions are readily discernable to one of 
ordinary skill in the art using routine experimentation. 

Homologous refers to the sequence similarity or sequence identity between two 
polypeptides or between two nucleic acid molecules. When a position in both of the two 

30 compared sequences is occupied by the same base or amino acid monomer subunit, e.g., 
if a position in each of two DNA molecules is occupied by adenine, then the molecules 
are homologous at that position. The percent of homology between two sequences is a 
function of the number of matching or homologous positions shared by the two 
sequences divided by the number of positions compared x 100. For example, if 6 of 10 
35 of the positions in two sequences are matched or homologous then the two sequences are 
60% homologous. By way of example, the DNA sequences ATTGCC and TATGGC 
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sharc 50% homology. Generally, a comparison is made when two sequences are aligned 
to give maximum homology. 

Nucleic acids are hybridizable to each other when at least one strand of a nucleic 
acid can anneal to the other nucleic acid under defined stringency conditions. 
5 Stringency of hybridization is determined by: (a) the temperature at which hybridization 
and/or washing is performed; and (b) the ionic strength and polarity of the hybridization 
and washing solutions. Hybridization requires that the two nucleic acids contain 
complementary sequences; depending on the stringency of hybridization, however, 
mismatches may be tolerated. Typically, hybridization of two sequences at high 

10 stingency (such as, for example, in a solution of 0.5X SSC, at 65° C) requires that the 

sequences be essentially completely homologous. Conditions of intermediate stringency 
(such as, for example, 2X SSC at 65 ° C) and low stringency (such as, for example 2X 
SSC at 55° C), require correspondingly less overall complementarity between the 
hybridizing sequences. (IX SSC is 0.15 M NaCl, 0.01 5 M Na citrate). 

1 5 The terms peptides, proteins, and polypeptides are used interchangeably herein. 

As used herein, the term "surface protein" refers to all surface accessible 
proteins, e.g. inner and outer membrane proteins, proteins adhering to the cell wall, and 
secreted proteins. 

A polypeptide has H. pylori biological activity if it has one, two and preferably 

20 more of the following properties: (1) if when expressed in the course of an H. pylori 
infection, it can promote, or mediate the attachment of H. pylori to a cell; (2) it has an 
enzymatic activity characteristic of an H. pylori protein; (3) or the gene which encodes it 
can rescue a lethal mutation in an H. pylori gene. A polypeptide has biological activity 
if it is an antagonist, agonist, or super-agonist of a polypeptide having one of the above- 

25 listed properties. 

A biologically active fragment or analog is one having an in vivo or in vitro 
activity which is characteristic of the H. pylori polypeptides shown in the Sequence 
Listing, or of other naturally occurring H. pylori polypeptides, e.g., one or more of the 
biological activities described herein. Especially preferred are fragments which exist in 

30 v/vo, e.g., fragments which arise from post transcriptional processing or which arise 
from translation of alternatively spliced RNA's. Fragments include those expressed in 
native or endogenous cells as well as those made in expression systems, e.g., in CHO 
cells. Because peptides such as H. pylori polypeptides often exhibit a range of 
physiological properties and because such properties may be attributable to different 

35 portions of the molecule, a useful H. pylori fragment or H. pylori analog is one which 
exhibits a biological activity in any biological assay for H. pylori activity. Most 
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preferably the fragment or analog possesses 10%, preferably 40%, more preferably 90% 
or greater of the activity of K pylori , in any in vivo or in vitro assay. 

Analogs can differ from naturally occurring H. pylori polypeptides in amino acid 
sequence or in ways that do not involve sequence, or both. Non-sequence modifications 

5 include changes in acetylation, methylation, phosphorylation, carboxylation, or 

glycosylation. Preferred analogs include H. pylori polypeptides (or biologically active 
fragments thereof) whose sequences differ from the wild-type sequence by one or more 
conservative amino acid substitutions or by one or more non-conservative amino acid 
substitutions, deletions, or insertions which do not substantially diminish the biological 

10 activity of the H. pylori polypeptide. Conservative substitutions typically include the 

substitution of one amino acid for another with similar characteristics, e.g., substitutions 
within the following groups: valine, glycine; glycine, alanine; valine, isoleucine, 
leucine; aspartic acid, glutamic acid; asparagine, glutamine; serine, threonine; lysine, 
arginine; and phenylalanine, tyrosine. Other conservative substitutions are outlined in 

15 Table 2 below. 



TABLE 2 

CONSERVATIVE AMINO ACID REPLACEMENTS 



For Amino 
Acid 


Code 


Replace with any of 


Alanine 


A 


D-Ala, Gly, beta-Ala, L-Cys, D-Cys 


Arginine 


R 


D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met, He, D-Met, D-Ile, Om, 
D-Orn 


| Asparagine 


N 


D-Asn, Asp, D-Asp, Glu, D-GIu, Gin, D-Gln 


fl Aspartic Acid 


D 


D-Asp, D-Asn, Asn, Glu, D-Glu, Gin, D-Gln 


| Cysteine 


C 


D-Cys, S-Me-Cys, Met, D-Met, Thr, D-Thr 


1 Glutamine 


Q 


D-Gln, Asn, D-Asn, Glu, D-Glu, Asp, D-Asp 


| Glutamic Acid 


E 


D-Glu, D-Asp, Asp, Asn, D-Asn, Gin, D-Gln 


| Glycine 


G 


Ala, D-Ala, Pro, D-Pro, p-Ala, Acp 


II Isoleucine 


I 


D-Ile, Val, D-VaL Leu, D-Leu, Met, D-Met 


| Leucine 


L 


D-Leu, Val, D-Val, Leu, D-Leu, Met, D-Met 


1 Lysine 


K 


D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, Met, D-Met, He, D-Ile, Om, 
D-Orn 


|| Methionine 


M 


D-Met, S-Me-Cys, lie, D-Ile, Leu, D-Leu, Val, D-Val 


Phenylalanine 


F 


D-Phe, Tyr, D-Thr, L-Dopa, His, D-His, Trp, D-Trp, Trans-3,4, or 
5-phenylproline, cis-3,4, or 5-phenylproline 


Proline 


P 


D-Pro, L-I-thioazolidine-4-carboxylic acid, D-or L-l-oxazolidine-4- 
carboxylic acid 


Serine 


S 


D-Ser, Thr, D-Thr, allo-Thr, Met, D-Met, Met(O), D-Met(O), L-Cys. D-Cys 


| Threonine 


T 


D-Thr, Ser, D-Ser, allo-Thr, Met, D-Met, Met(O), D-Met(0) ? Val, D-Val 


II Tyrosine 


Y 


D-Tyr, Phe, D-Phe, L-Dopa, His, D-His 


| Valine 


V 


D-VaL Leu. D-Leu. lie, D-Ile, Met. D-Met 
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Other analogs within the invention are those with modifications which increase 
peptide stability; such analogs may contain, for example, one or more non-peptide bonds 
(which replace the peptide bonds) in the peptide sequence. Also included are: analogs 
that include residues other than naturally occurring L-amino acids, e.g., D-amino acids 
5 or non-naturally occurring or synthetic amino acids, e.g., p or y amino acids; and cyclic 
analogs. 

As used herein, the term "fragment", as applied to an //. pylori analog, will 
ordinarily be at least about 20 residues, more typically at least about 40 residues, 
preferably at least about 60 residues in length. Fragments of H. pylori polypeptides can 

10 be generated by methods known to those skilled in the art. The ability of a candidate 
fragment to exhibit a biological activity of K pylori polypeptide can be assessed by 
methods known to those skilled in the art as described herein. Also included are H. 
pylori polypeptides containing residues that are not required for biological activity of the 
peptide or that result from alternative mRNA splicing or alternative protein processing 

1 5 events. 

An "immunogenic component" as used herein is a moiety, such as an H. pylori 
polypeptide, analog or fragment thereof, that is capable of eliciting a humoral and/or 
cellular immune response in a host animal. 

An "antigenic component" as used herein is a moiety, such as an H. pylori 
20 polypeptide, analog or fragment thereof, that is capable of binding to a specific antibody 
with sufficiently high affinity to form a detectable antigen-antibody complex. 

As used herein, the term "transgene" means a nucleic acid (encoding, e.g., one or 
more polypeptides), which is partly or entirely heterologous, i.e., foreign, to the 
transgenic animal or cell into which it is introduced, or, is homologous to an endogenous 
25 gene of the transgenic animal or cell into which it is introduced, but which is designed to 
be inserted, or is inserted, into the cell's genome in such a way as to alter the genome of 
the cell into which it is inserted (e.g., it is inserted at a location which differs from that 
of the natural gene or its insertion results in a knockout). A transgene can include one or 
more transcriptional regulatory sequences and any other nucleic acid, such as introns, 
30 that may be necessary for optimal expression of the selected nucleic acid, all operably 
linked to the selected nucleic acid, and may include an enhancer sequence. 

As used herein, the term "transgenic cell" refers to a cell containing a transgene. 

As used herein, a "transgenic animal" is any animal in which one or more, and 
preferably essentially all, of the cells of the animal includes a transgene. The transgene 
35 can be introduced into the cell, directly or indirectly by introduction into a precursor of 
the cell, by way of deliberate genetic manipulation, such as by microinjection or by 
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infection with a recombinant virus. This molecule may be integrated within a 
chromosome, or it may be extrachromosomally replicating DNA. 

The term "antibody" as used herein is intended to include fragments thereof 
which are specifically reactive with H. pylori polypeptides. 
5 As used herein, the term "cell-specific promoter" means a DNA sequence that 

serves as a promoter, i.e., regulates expression of a selected DNA sequence operably 
linked to the promoter, and which effects expression of the selected DNA sequence in 
specific cells of a tissue. The term also covers so-called "leaky" promoters, which 
regulate expression of a selected DNA primarily in one tissue, but cause expression in 
10 other tissues as well. 

Misexpression, as used herein, refers to a non-wild type pattern of gene 
expression. It includes: expression at non-wild type levels, i.e., over or under 
expression; a pattern of expression that differs from wild type in terms of the time or 
stage at which the gene is expressed, e.g., increased or decreased expression (as 
1 5 compared with wild type) at a predetermined developmental period or stage; a pattern of 
expression that differs from wild type in terms of decreased expression (as compared 
with wild type) in a predetermined cell type or tissue type; a pattern of expression that 
differs from wild type in terms of the splicing size, amino acid sequence, post- 
transitional modification, or biological activity of the expressed polypeptide; a pattern of 
20 expression that differs from wild type in terms of the effect of an environmental stimulus 
or extracellular stimulus on expression of the gene, e.g., a pattern of increased or 
decreased expression (as compared with wild type) in the presence of an increase or 
decrease in the strength of the stimulus. 

As used herein, "host cells" and other such terms denoting microorganisms or 
25 higher eukaryotic cell lines cultured as unicellular entities refers to cells which can 

become or have been used as recipients for a recombinant vector or other transfer DNA, 
and include the progeny of the original cell which has been transfected. It is understood 
by individuals skilled in the art that the progeny of a single parental cell may not 
necessarily be completely identical in genomic or total DNA compliment to the original 
30 parent, due to accident or deliberate mutation. 

As used herein, the term "control sequence" refers to a nucleic acid having a base 
sequence which is recognized by the host organism to effect the expression of encoded 
sequences to which they are ligated. The nature of such control sequences differs 
depending upon the host organism; in prokaryotes, such control sequences generally 
35 include a promoter, ribosomal binding site and terminators; in eukaryotes, generally 

such control sequences include promoters, terminators and in some instances, enhancers. 
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The term control sequence is intended to include at a minimum, all components whose 
presence is necessary for expression, and may also include additional components whose 
presence is advantageous, for example, leader sequences. 

As used herein, the term "operably linked" refers to sequences joined or ligated 
5 to function in their intended manner. For example, a control sequence is operably linked 
to coding sequence by ligation in such a way that expression of the coding sequence is 
achieved under conditions compatible with the control sequence and host cell. 

The metabolism of a substance, as used herein, means any aspect of the, 
expression, function, action, or regulation of the substance. The metabolism of a 

10 substance includes modifications, e.g., covalent or non-covalent modifications of the 

substance. The metabolism of a substance includes modifications, e.g., covalent or non- 
covalent modification, the substance induces in other substances. The metabolism of a 
substance also includes changes in the distribution of the substance. The metabolism of 
a substance includes changes the substance induces in the distribution of other 

1 5 substances. 

A "sample" as used herein refers to a biological sample, such as, for example, 
tissue or fluid isloated from an individual (including without limitation plasma, serum, 
cerebrospinal fluid, lymph, tears, saliva and tissue sections) or from in vitro cell culture 
constituents, as well as samples from the environment. 

20 The practice of the invention will employ, unless otherwise indicated, 

conventional techniques of chemistry, molecular biology, microbiology, recombinant 
DNA, and immunology, which are within the skill of the art. Such techniques are 
explained fully in the literature. See e.g., Sambrook, Fritsch, and Maniatis, Molecular 
Cloning; Laboratory Manual 2nd ed. (1989); DNA Cloning, Volumes I and II (D.N 

25 Glover ed. 1 985); Oligonucleotide Synthesis (M.J. Gait ed, 1984); Nucleic Acid 
Hybridization (B.D. Hames & S.J. Higgins eds. 1984); the series, Methods in 
Enzymoloqy (Academic Press, Inc.), particularly Vol. 154 and Vol. 155 (Wu and 
Grossman, eds.) and PCR-A Practical Approach (McPherson, Quirke, and Taylor, eds., 
1991). 

30 

I. Isolation of Nucleic Acids of H. pylori and Uses Therefor 
//. pylori Genomic Seouence 

This invention provides nucleotide sequences of the genome of//, pylori which 
thus comprises a DNA sequence library of//, pylori genomic DNA. The detailed 
35 description that follows provides nucleotide sequences of H. pylori^ and also describes 
how the sequences were obtained and how ORFs and protein-coding sequences were 
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identified. Also described are methods of using the disclosed H. pylori sequences in 
methods including diagnostic and therapeutic applications. Furthermore, the library can 
be used as a database for identification and comparison of medically important 
sequences in this and other strains of//, pylori. 

5 To determine the genomic sequence of H. pylori, DNA was isolated from a strain 

of//, pylori and mechanically sheared by nebulization to a median size of 2 kb. 
Following size fractionation by gel electrophoresis, the fragments were blunt-ended, 
ligated to adapter oligonucleotides, and cloned into each of 20 different pMPX vectors 
(Rice et ah, abstracts of Meeting of Genome Mapping and Sequencing, Cold Spring 

10 Harbor, NY, 5/1 1-5/15, 1994, p. 225) to construct a series of "shotgun" subclone 
libraries. 

DNA sequencing was achieved using multiplex sequencing procedures 
essentially as disclosed in Church et al., 1988, Science 240:185; U.S. Patents No. 
4,942,124 and 5,149,625). DNA was extracted from pooled cultures and subjected to 
1 5 chemical or enzymatic sequencing. Sequencing reactions were resolved by 

electrophoresis, and the products were transferred and covalently bound to nylon 
membranes. Finally, the membranes were sequentially hybridized with a series of 
labelled oligonucleotides complimentary to "tag" sequences present in the different 
shotgun cloning vectors. In this manner, a large number of sequences could be obtained 
20 from a single set of sequencing reactions. The cloning and sequencing procedures are 
described in more detail in the Exemplification. 

Individual sequence reads obtained in this manner were assembled using the 
FALCON™ program (Church et al , 1994, Automated DNA Sequencing and Analysis, 
J.C. Venter, ed., Academic Press) and PHRAP (P. Green, Abstracts of DOE Human 
25 Genome Program Contractor-Grantee Workshop V, Jan. 1996, p.l 57). A resulting 

assembly of contigs, each representing a continuous stretch of DNA or DNA sequence 
was obtained. The average contig length was about 3 kb. 

A variety of approaches are used to order the contigs so as to obtain a continuous 
sequence representing the entire H. pylori genome. Synthetic oligonucleotides are 
30 designed that are complementary to sequences at the end of each contig. These 
oligonucleotides may be hybridized to libaries of H. pylori genomic DNA in, for 
example, lambda phage vectors or plasmid vectors to identify clones that contain 
sequences corresponding to the junctional regions between individual contigs. Such 
clones are then used to isolate template DNA and the same oligonucleotides are used as 
35 primers in polymerase chain reaction (PCR) to amplify junctional fragments, the 
nucleotide sequence of which was then determined. 
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The H. pylori sequences were analyzed for the presence of open reading frames 
(ORFs) comprising at least 180 nucleotides. ORFs of at least 180 nucleotides (based on 
stop-to-stop codon reads) were predicted. As a result of the analysis of ORFs based on 
stop-to-stop codon reads, it should be understood that these ORFs may not correspond to 
5 the ORF of a naturally-occurring H. pylori polypeptide. These ORFs may contain start 
codons which indicate the initiation of protein synthesis of a naturally-occurring H. 
/?>//or/polypeptide. Such start codons within the ORFs provided herein can be identified 
by those of ordinary skill in the relevant art and the resulting ORF and the encoded H. 
pylori polypeptide is within the scope of this invention. For example, within the ORFs a 

10 codon such as AUG or GUG (encoding methionine or valine) which is part of the 
initiation signal for protein synthesis can be identified and the ORF modified to 
correspond to a naturally-occurring K pylori polypeptide. The predicted coding regions 
were defined by evaluating the coding potential of such sequences with the program 
GENEMARK™ (Borodovsky and Mclninch, 1993, Comp. Chem. 17:123). 

15 Other H. pylori Nucleic Acids 

The nucleic acids of this invention may be obtained directly from the DNA of the 
above referenced K pylori strain by using the polymerase chain reaction (PCR). See 
"PCR, A Practical Approach" (McPherson, Quirke, and Taylor, eds., IRL Press, Oxford, 
UK, 1991) for details about the PCR. High fidelity PCR can be used to ensure a faithful 

20 DNA copy prior to expression. In addition, amplified products can be checked by 

conventional sequencing methods. Clones carrying the desired sequences described in 
this invention may be obtained by screening the libraries by means of the PCR or by 
hybridization of synthetic oligonucleotide probes to filter lifts of the library colonies or 
plaques as known in the art (see, e.g., Sambrook et aL, Molecular Cloning, A Laboratory* 

25 Manual 2nd edition, 1 989, Cold Spring Harbor Press, NY). 

It is also be possible to obtain nucleic acids encoding H. pylori polypeptides 
from a cDNA library in accordance with protocols herein described. A cDNA encoding 
an K pylori polypeptide can be obtained by isolating total mRNA from an appropriate 
cell line. Double stranded cDNAs can then be prepared from the total mRNA. 

30 Subsequently, the cDNAs can be inserted into a suitable plasmid or viral (e.g., 

bacteriophage) vector using any one of a number of known techniques. Genes encoding 
K pylori polypeptides can also be cloned using established polymerase chain reaction 
techniques in accordance with the nucleotide sequence information provided by the 
invention. The nucleic acids of the invention can be DNA or RNA. Preferred nucleic 

35 acids are shown in the Sequence Listing, 
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The nucleic acids of the invention can also be chemically synthesized using 
standard techniques. Various methods of chemically synthesizing polydeoxynucleotidcs 
are known, including solid-phase synthesis which, like peptide synthesis, has been fully 
automated in commercially available DNA synthesizers (See e.g., Itakura et ah U.S. 
5 Patent No. 4,598,049; Caruthers et ah U.S. Patent No. 4,458,066; and Itakura U.S. 
Patent Nos. 4,401,796 and 4,373,071, incorporated by reference herein). 

Nucleic acids isolated or synthesized in accordance with features of the present 
invention are useful, by way of example, without limitation, as probes, primers, capture 
ligands, antisense genes and for developing expression systems for the synthesis of 
10 proteins and peptides corresponding to such sequences. As probes, primers, capture 
ligands and antisense agents, the nucleic acid normally consists of all or part 
(approximately twenty or more nucleotides for specificity as well as the ability to form 
stable hybridization products) of the nucleic acids shown in the Sequence Listing. These 
uses are described in further detail below. 
15 Probes 

A nucleic acid isolated or synthesized in accordance with the nucleotide 
sequences set forth in the Sequence Listing can be used as a probe to specifically detect 
H. pylori. With the sequence information set forth in the present application, sequences 
of twenty or more nucleotides are identified which provide the desired inclusivity and 
20 exclusivity with respect to H. pylori^ and extraneous nucleic acids likely to be 

encountered during hybridization conditions. More preferably, the sequence will 
comprise at least twenty to thirty nucleotides to convey stability to the hybridization 
product formed between the probe and the intended target molecules. 

Sequences larger than 1000 nucleotides in length are difficult to synthesize but 
25 can be generated by recombinant DNA techniques. Individuals skilled in the art will 

readily recognize that the nucleic acids, for use as probes, can be provided with a label to 
facilitate detection of a hybridization product. 

Nucleic acid isolated and synthesized in accordance with the Sequence Listing 
can also be useful as probes to detect homologous regions (especially homologous 
30 genes) of other Helicobacter species using appropriate stringency hybridization 
conditions as described herein. 
Capture Ligand 

For use as a capture ligand, the nucleic acid selected in the manner described 
above with respect to probes, can be readily associated with a support. The manner in 
35 which nucleic acid is associated with supports is well known. Nucleic acid having 

twenty or more nucleotides in a sequence contained in the Sequence Listing have utility 
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to separate H pylori nucleic acid from the nucleic acid of each other and other 
organisms. Nucleic acid having twenty or more nucleotides in a sequence shown in the 
Sequence Listing can also have utility to separate other Helicobacter species from each 
other and from other organisms. Preferably, the sequence will comprise at least twenty 
5 nucleotides to convey stability to the hybridization product formed between the probe 
and the intended target molecules. Sequences larger than 1000 nucleotides in length are 
difficult to synthesize but can be generated by recombinant DNA techniques. 
Primers 

Nucleic acid isolated or synthesized in accordance with the sequences described 

10 herein have utility as primers for the amplification of H pylori nucleic acid. These 
nucleic acids may also have utility as primers for the amplification of nucleic acids in 
other Helicobacter species. With respect to polymerase chain reaction (PCR) 
techniques, nucleic acids of > 10-15 nucleotides contained in the Sequence Listing have 
utility in conjunction with suitable enzymes and reagents to create copies of H pylori 

1 5 nucleic acid. More preferably, the sequence will comprise twenty or more nucleotides to 
convey stability to the hybridization product formed between the primer and the 
intended target molecules. Binding conditions of primers greater than 1 00 nucleotides 
are more difficult to control to obtain specificity. High fidelity PCR can be used to 
ensure a faithful DNA copy prior to expression. In addition, amplified products can be 

20 checked by conventional sequencing methods. 

The copies can be used in diagnostic assays to detect specific sequences, 
including genes from H pylori and/or other Helicobacter species. The copies can also 
be incorporated into cloning and expression vectors to generate polypeptides 
corresponding to the nucleic acid synthesized by PCR, as is described in greater detail 

25 herein. 

Antisense 

Nucleic acid or nucleic acid-hybridizing derivatives isolated or synthesized in 
accordance with the sequences described herein have utility as antisense agents to 
prevent the expression of//, pylori genes. These sequences also have utility as antisense 

30 agents to prevent expression of genes of other Helicobacter species. 

In one embodiment, nucleic acid or derivatives corresponding to H pylori 
nucleic acids is loaded into a suitable carrier such as a liposome or bacteriophage for 
introduction into bacterial cells. For example, a nucleic acid having twenty or more 
nucleotides is capable of binding to bacteria nucleic acid or bacteria messenger RNA. 

35 Preferably, the antisense nucleic acid is comprised of 20 or more nucleotides to provide 
necessary stability of a hybridization product of non-naturally occurring nucleic acid and 
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bacterial nucleic acid and/or bacterial messenger RNA. Nucleic acid having a sequence 
greater than 1000 nucleotides in length is difficult to synthesize but can be generated by 
recombinant DNA techniques. Methods for loading antisense nucleic acid in liposomes 
is known in the art as exemplified by U.S. Patent 4,241,046 issued December 23, 1980 
5 to Papahadjopoulos et al. 

II. Expression of H. pylori Nucleic Acids 

Nucleic acid isolated or synthesized in accordance with the sequences described 
herein have utility to generate polypeptides. The nucleic acids exemplified in the 
10 Sequence Listing or fragments of said nucleic acid encoding active portions of H. pylori 
polypeptides can be cloned into suitable vectors or used to isolate nucleic acid. The 
isolated nucleic acid is combined with suitable DNA linkers and cloned into a suitable 
vector. 

The function of a specific gene or operon can be ascertained by expression in a 

15 bacterial strain under conditions where the activity of the gene product(s) specified by 
the gene or operon in question can be specifically measured. Alternatively, a gene 
product may be produced in large quantities in an expressing strain for use as an antigen, 
an industrial reagent, for structural studies, etc. This expression can be accomplished in 
a mutant strain which lacks the activity of the gene to be tested, or in a strain that does 

20 not produce the same gene product(s). This includes, but is not limited to other 
Helicobacter strains, and other bacterial strains such as E. coli, Norcardia, 
Corynebacterium, and Streptomyces species. In some cases the expression host will 
utilize the natural Helicobacter promoter whereas in others, it will be necessary to drive 
the gene with a promoter sequence derived from the expressing organism (e.g., an E. 

25 coli beta-galactosidase promoter for expression in E. coli). 

To express a gene product using the natural H, pylori promoter, a procedure such 
as the following can be used. A restriction fragment containing the gene of interest, 
together with its associated natural promoter element and regulatory sequences 
(identified using the DNA sequence data) is cloned into an appropriate recombinant 

30 plasmid containing an origin of replication that functions in the host organism and an 
appropriate selectable marker. This can be accomplished by a number of procedures 
known to those skilled in the art. It is most preferably done by cutting the plasmid and 
the fragment to be cloned with the same restriction enzyme to produce compatible ends 
that can be ligated to join the two pieces together. The recombinant plasmid is intro- 

35 duced into the host organism by, for example, electroporation and cells containing the 
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recombinant plasmid are identified by selection for the marker on the plasmid. Express- 
ion of the desired gene product is detected using an assay specific for that gene product. 

In the case of a gene that requires a different promoter, the body of the gene 
(coding sequence) is specifically excised and cloned into an appropriate expression 
5 plasmid. This subcloning can be done by several methods, but is most easily 
accomplished by PCR amplification of a specific fragment and ligation into an 
expression plasmid after treating the PCR product with a restriction enzyme or 
exonuclease to create suitable ends for cloning. 

A suitable host cell for expression of a gene can be any procaryotic or eucaryotic 

1 0 cell. For example, an H. pylori polypeptide can be expressed in bacterial cells such as E. 
coli, insect cells (baculovirus), yeast, or mammalian cells such as Chinese hamster ovary 
cell (CHO). Other suitable host cells are known to those skilled in the art. 

Expression in eucaryotic cells such as mammalian, yeast, or insect cells can lead 
to partial or complete glycosylation and/or formation of relevant inter- or intra-chain 

15 disulfide bonds of a recombinant peptide product. Examples of vectors for expression in 
yeast S. cerivisae include pYepSecl (Baldari. et al., (1987) Embo J. 6:229-234), pMFa 
(Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 (Schultz et ah, (1987) Gene 
54: 11 3- 123), and pYES2 (Invitrogen Corporation, San Diego, CA). Baculovirus vectors 
available for expression of proteins in cultured insect cells (SF 9 cells) include the pAc 

20 series (Smith et al., (1983) Mol Cell Biol. 3:2156-2165) and the pVL series (Lucklow, 
V.A., and Summers, M.D., (1989) Virology J70:31-39). Generally, COS cells 
(Gluzman, Y., (1981) Cell 23:175-182) are used in conjunction with such vectors as 
pCDM 8 (Aruffo, A. and Seed, B., (1987) Proc. NatL Acad. Sci. USA 84:8573-8577) for 
transient amplification/expression in mammalian cells, while CHO (dhfr Chinese 

25 Hamster Ovary) cells are used with vectors such as pMT2PC (Kaufman et aL (1 987), 
EMBO J. 6:187-195) for stable amplification/expression in mammalian cells. Vector 
DNA can be introduced into mammalian cells via conventional techniques such as 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, or electroporation. Suitable methods for transforming host cells can be 

30 found in Sambrook et al. ( Molecular Cloning: A Laboratory Manual . 2nd Edition, Cold 
Spring Harbor Laboratory press (1989)), and other laboratory textbooks. 

Expression in procaryotes is most often carried out in E. coli with either fusion 
or non-fusion inducible expression vectors. Fusion vectors usually add a number of 
NH2 terminal amino acids to the expressed target gene. These NH2 terminal amino 

35 acids often are referred to as a reporter group. Such reporter groups usually serve two 
purposes: 1 ) to increase the solubility of the target recombinant protein; and 2) to aid in 
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the purification of the target recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a proteolytic cleavage site is 
introduced at the junction of the reporter group and the target recombinant protein to 
enable separation of the target recombinant protein from the reporter group subsequent 
5 to purification of the fusion protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression 
vectors include pGEX (Amrad Corp., Melbourne, Australia), pMAL (New England 
Biolabs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione 
S-transferase, maltose E binding protein, or protein A, respectively, to the target 
1 0 recombinant protein. A preferred reporter group is poly(His), which may be fused to the 
amino or carboxy terminus of the protein and which renders the recombinant fusion 
protein easily purifiable by metal chelate chromatography. 

Inducible non-fusion expression vectors include pTrc (Amann et al., (1988) Gene 
69:301-315) and pETl Id (Studier et aL, Gene Expression Technology: Methods in 
15 Enzymology 185 . Academic Press, San Diego, California (1990) 60-89). While target 
gene expression relies on host RNA polymerase transcription from the hybrid trp-lac 
fusion promoter in pTrc, expression of target genes inserted into pETl Id relies on 
transcription from the T7 gnlO-lac 0 fusion promoter mediated by coexpressed viral 
RNA polymerase (T7 gnl). This viral polymerase is supplied by host strains 
20 BL2 1 (DE3) or HMS 1 74(DE3) from a resident X prophage harboring a T7 gnl under the 
transcriptional control of the lacUV 5 promoter. 

For example, a host cell transfected with a nucleic acid vector directing 
expression of a nucleotide sequence encoding an K pylori polypeptide can be cultured 
under appropriate conditions to allow expression of the polypeptide to occur. The 
25 polypeptide may be secreted and isolated from a mixture of cells and medium containing 
the peptide. Alternatively, the polypeptide may be retained cytoplasmically and the cells 
harvested, lysed and the protein isolated. A cell culture includes host cells, media and 
other byproducts. Suitable media for cell culture are well known in the art. 
Polypeptides of the invention can be isolated from cell culture medium, host cells, or 
30 both using techniques known in the art for purifying proteins including ion-exchange 
chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and 
immunoaffinity purification with antibodies specific for such polypeptides. 
Additionally, in many situations, polypeptides can be produced by chemical cleavage of 
a native protein (e.g., tryptic digestion) and the cleavage products can then be purified 
35 by standard techniques. 
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In the case of membrane bound proteins, these can be isolated from a host cell by 
contacting a membrane-associated protein fraction with a detergent forming a 
solubilized complex, where the membrane-associated protein is no longer entirely 
embedded in the membrane fraction and is solubilized at least to an extent which allows 
5 it to be chromatographically isolated from the membrane fraction. Several different 
criteria are used for choosing a detergent suitable for solubilizing these complex. For 
example, one property considered is the ability of the detergent to solubilize the H. 
pylori protein within the membrane fraction at minimal denaturation of the membrane- 
associated protein allowing for the activity or functionality of the membrane-associated 

1 0 protein to return upon reconstitution of the protein. Another property considered when 
selecting the detergent is the critical micells concentration (CMC) of the detergent in 
that the detergent of choice preferably has a high CMC value allowing for ease of 
removal after reconstitution. A third property considered when selecting a detergent is 
the hydrophobicity of the detergent. Typically, membrane-associated proteins are very 

15 hydrophobic and therefore detergents which are also hydrophobic, e.g., the triton series, 
would be useful for solubilizing the hydrophobic proteins. Another property important 
to a detergent can be the capability of the detergent to remove the H. pylori protein with 
minimal protein-protein interaction facilitating further purification. A fifth property of 
the detergent which should be considered is the charge of the detergent. For example, if 

20 it is desired to use ion exchange resins in the purification process then preferably 

detergent should be an uncharged detergent. Chromatographic techniques which can be 
used in the final purification step are known in the art and include hydrophobic 
interaction, lectin affinity, ion exchange, dye affinity and immunoaffinity. 

One strategy to maximize recombinant K pylori peptide expression in E. coli is 

25 to express the protein in a host bacteria with an impaired capacity to proteolytically 

cleave the recombinant protein (Gottesman, S., Gene Expression Technology: Methods 
in Enzvmology 185. Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy would be to alter the nucleic acid encoding K pylori peptide to be inserted into 
an expression vector so that the individual codons for each amino acid would be those 

30 preferentially utilized in highly expressed E. coli proteins (Wada et al., ( 1 992) Nuc. 
Acids Res. 20:2 111-2118). Such alteration of nucleic acids of the invention can be 
carried out by standard DNA synthesis techniques. 

The nucleic acids of the invention can also be chemically synthesized using 
standard techniques. Various methods of chemically synthesizing polydeoxynucleotides 

35 are known, including solid-phase synthesis which, like peptide synthesis, has been fully 
automated in commercially available DNA synthesizers (See, e.g., Itakura et al. U.S. 
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Patent No. 4,598,049; Camthers et al. U.S. Patent No. 4,458,066; and Itakura U.S. 
Patent Nos. 4,401,796 and 4,373,071, incorporated by reference herein). 

III. H. pylori Polypeptides 
5 This invention encompasses isolated H. pylori polypeptides encoded by the 

disclosed H. pylori genomic sequences, including the polypeptides contained in the 
Sequence Listing. Polypeptides of the invention are preferably at least 5 amino acid 
residues in length. Using the DNA sequence information provided herein, the amino 
acid sequences of the polypeptides encompassed by the invention can be deduced using 

10 methods well-known in the art. It will be understood that the sequence of an entire 

nucleic acid encoding an H. pylori polypeptide can be isolatedand identified based on an 
ORF that endoes only a fragment of the cognate protein-coding region. This can be 
acheived, for example, by using the isolated nucleic acid encoding the ORF, or 
fragments thereof, to prime a polymerase chain reaction with genomic H. pylori DNA as 

1 5 template; this is followed by sequencing the amplified product. 

The polypeptides of the invention can be isolated from wild-type or mutant K 
pylori cells or from heterologous organisms or cells (including, but not limited to, 
bacteria, yeast, insect, plant and mammalian cells) into which an H. pylori nucleic acid 
has been introduced and expressed. In addition, the polypeptides can be part of 

20 recombinant fusion proteins. 

H. pylori polypeptides of the invention can be chemically synthesized using 
commercially automated procedures such as those referenced herein. 

Many of the polypeptides of the invention are related to one another. Some of 
these relationships are described in Tables 3-6 below. All of the polypeptide lengths in 

25 Table 3 are from stop codon to stop codon in the nucleotide sequence of H. pylori. As is 
known in the art, the actual polypeptide lengths are usually shorter than the stop-to-stop 
codon lengths because a start codon for an initiator charged tRNA usually appears a few 
nucleotides downstream from the prior stop codon and within a few nucleotides 
following a ribosome binding site (also known as a "Shine-Delgarno sequence"). Since 

30 most of the ribosome binding sites in K pylori have many of the same general features 
of those known in E. coli, one skilled in the art can predict the actual start codon with 
good reliability from the stop-to-stop nucleotide sequence of an open reading frame. 
The polypeptide sequences of SEQ ID NOs:492-743 of this invention represent the stop- 
to-stop codon lengths of the open reading frames of SEQ ID NOs: 1-252. All other 

35 polypeptide sequences of this invention represent the predicted start to stop protein 

lengths from the nucleotide sequences. One skilled in the art can recognize start sites in 
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the stop-to-stop open reading frames of the nucleotide sequences presented herein. In 
addition, one skilled in the art will occasionally detect alternative start sites, some of 
which may be utilized in vivo by the cellular machinery. The number of these 
alternative start sites is sufficiently small that one skilled in the art can readily test them 
5 in a recombinant expression systems known in the art to determine which ones provide 
authentic functional protein products. 

The relationship between the polypeptides shown in Table 3 can be described as 
follows. First, all of the polypeptides of Table 3 are at least 90% identical with each 
other over most of their lengths, and most are over 95% identical with each other. 

10 Second, the stop-to-stop lengths are different for some of the homologous pairs of 

polypeptides. In some cases, the shorter polypeptide contains the relevant portion of the 
protein exhibiting utility in this invention; in some cases, the longer polypeptide may 
exhibit improved utility. Third, some polypeptides in the second column are 
homologous to two shorter polypeptides in the fifth column. 

15 In all cases, the homology relationships described in Table 3 are highly 

significant. For example, a typical H. pylori gene product will exhibit amino acid 
sequence identities of between 92% and 100% among different strains of//, pylori 
selected from human patients. The nucleotide sequences encoding the related 
polypeptides of this invention are also very similar to one another. For example, 

20 nucleotide probes derived from the coding sequence of a polypeptide of this invention 
can be used in PCR or hybridization experiments to identify clones carrying the 
nucleotide sequence encoding the homologous related polypeptide. 
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Additional relationships between polypeptides of the invention are described in 
Table 4 below. All of the polypeptide lengths in Table 4 below are measured from stop 
codon to stop codon in the nucleotide sequence of H. pylori. 

The relationship between the polypeptides shown in Table 4 can be described as 
5 follows. First, all of the polypeptides of Table 4 are at least 90% identical with each 
other over most of their lengths, and most are over 95% identical with each other. 
Second, the stop-to-stop lengths are different for some of the homologous pairs of 
polypeptides. In some cases, the shorter polypeptide contains the relevant portion of the 
protein exhibiting utility in this invention; in some cases, the longer polypeptide may 
10 exhibit improved utility. Third, some polypeptides in the second column are 
homologous to two shorter polypeptides in the fifth column. 

In all cases, the homology relationships described in Table 4 are highly 
significant. For example, a typical H. pylori gene product will exhibit amino acid 
sequence identities of between 92% and 100% among different strains of K pylori 
1 5 selected from human patients. The nucleotide sequences encoding the related 

polypeptides of this invention are also very similar to one another. For example, 
nucleotide probes derived from the coding sequence of a polypeptide of this invention 
can be used in PCR or hybridization experiments to identify clones carrying the 
nucleotide sequence encoding the homologous related polypeptide. 

20 

TABLE 4 



ORF Name 


aa 
Seq 
ID# 


Length 
(aa) 


ORF Name 


aa 
Seq 
ID# 


Length 
(aa) 


% 
Iden- 
tity 


Over- 
lap 
aa) 


Change 

in 
Length 


0lcelll04 36125337 cl 8 


849 


74 


hplel0523orf3 


612 


63 


100 


63 


11 


01ce2M04 33203250 c3 87 


850 


509 


05cpll9!!orf35 


549 


206 


100 


204 


303 


01cp20708J36 134808 J2_l 1 


928 


230 


14eel0419orf5 


704 


229 


too 


229 


1 


02ae31010 12504512 f3 28 aa 


929 


490 


0lgel0801orfZ 


514 


60 


95 


56 


430 


02ae31010 16833312 £2 19 


930 


181 


02cell022orf2 


530 


184 


98 


181 


-3 


02ae31010 2117087 O 34 


931 


137 


07cel0203orfl7 


635 


76 


97 


77 


61 


02ae3 1 0 1 0 J302083 1 7_f 1 _ 1 4 


932 


343 


hplpl3947orfl0 


717 


343 


100 


343 


0 


02ae31010 34616666 f2 27 


851 


184 


Dlcell618orfl 


503 


124 


99 


125 


60 


02ae310IO 34616666 fl 27 


851 


184 


hplp!3947orfll 


616 


81 


94 


77 


103 


02ae3I010 35270000 G 33 


852 


231 


Olepll504orf5 


505 


156 


99 


156 


75 


02ae3101O 36132785 f2 29 


853 


438 


01ce]1618orf3 


499 


67 


98 


64 


371 


02ae31010 36132785 f2 29 


853 


438 


01cell618orfl3 


504 


378 


97 


387 


60 


02ae31010 5085162 cl 47 


934 


416 


07cel0203orfli 


634 


416 


100 


416 


0 


02cp 1 06 1 5_26573462_c 1 _45 


935 


168 


hp3pl0304orf2 


727 


126 


91 


129 


42 


02gel01 I6_1578l452_cl_87 


854 


97 


06gel0115orfl7 


563 


97 


100 


97 


0 


02ge 10 11 61 6803 5 1 3_f2_34 


855 


488 


01cpll710orfl6 


652 


64 


100 


62 


424 


02gel01 16J6803513_f2_34 


855 


488 


01cpll710orf5 


654 


146 


99 


145 


342 


02gel01 16JJ6367936_c)_92 


857 


173 


02ge20116orf25 


521 


173 


100 


173 


0 


03ae 10804 12609533 cl 26 


936 


257 


O6epl0306orfl0 


610 


257 


100 


257 


0 
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hp6p 1 0903_4398263 J3_6.aa 


924 


370 


05gpl 1901orfl9 


553 


370 


100 


370 


0 


hp6p!0903_4398263_O_6.aa 


924 


370 


05gpt I901orfl9 


553 


370 


100 


370 


0 


03ae 10804 21698400 c2 32 


937 


386 


06epl0306orfll 


611 


386 


100 


386 


0 


03ae 10804 23485968 c3 47 


858 


88 


06epl0306orf5 


561 


88 


100 


88 


0 


04ep4 1 903_2675793 7 J3_ 1 6 


938 


703 


29apl0306orD 


711 


325 


97 


319 


378 


04ep4 1 903__26757937 J3_l 6 


938 


703 


29apl !902orfl 


712 


280 


92 


280 


423 


04ep4 1 903_4 1 0 1 593_f2_ 1 0.aa 


940 


1240 


14ee21H8orfl 


706 


365 


96 


358 


875 


05ce 1 09 1 0_25598277_f3_3 


859 


258 


07eel l519orfl 


567 


170 


96 


166 


88 


05epl0815_26570332_c2_99 


941 


183 


12gel0610orf2 


702 


97 


98 


97 


86 


05ep 1 08 ! 5_4 1 95292_c 1 _84 


942 


44 i 


hplel0554orfl 


715 


103 


99 


94 


338 


05ep 1 08 1 5_47 1 9 1 75_c 1 _83 


943 


663 


06eeI0207orfl 


606 


99 


99 


99 


564 


06aell016 30579712 f2 21 


860 


239 


03ce21717orfl 


526 


193 


99 


193 


46 


06ce20610 1367157 fl 8 aa 


844 


248 


29gel0111orf3 


714 


124 


90 


121 


124 


O6ce20610 29298537 c2 32 


861 


234 


05gp201Uorf4 


555 


234 


100 


234 


0 


06ce206 1 0J39 1 3967_c3_36 


862 


283 


05gp201Ilorf6 


556 


283 


100 


283 


0 


06ce20610_4331338J3_18 


863 


353 


05gp20lllorfl2 


554 


243 


99 


245 


110 


06cpl 1 1 18_212827_cl J7 


864 


73 


06cpllll8orf7 


558 


73 


100 


73 


0 


06cpl 12 17_1 9720300 J3 J 1 


865 


308 


hplp!3852orf6 


614 


646 


96 


278 


-338 


06cp 11 2 1 7_488 1 263J2_9 


866 


93 


hplpl3852orf7 


615 


93 


100 


93 


0 


06cp 1121 7_4897077_f 1 jS 


867 


100 


hplpl3852orf4 


613 


100 


100 


100 


0 


06cp30603_2 1492 1 87 J2_4 1 


868 


106 


04gpl !213orfl4 


539 


136 


98 


106 


-30 


06cp30603_34024 1 87_n _20 


869 


483 


04gpll213orfll 


538 


122 


100 


122 


361 


06cp30603_34024 1 87_fl _20 


869 


483 


04gpll213orf5 


669 


382 


95 


377 


101 


06cp30603_67921 8_f2_34 


944 


380 


hp3el0302orfl7 


721 


231 


98 


236 


149 


06ep 1 06 1 5_ 1 4649077_O_52 


871 


300 


04cell617orf4 


534 


152 


96 


145 


148 


06ep 1 06 1 5_96 1 562_f2_4 1 


945 


636 


29ep20112orf2 


713 


135 


98 


87 


501 


06ep 1 06 1 5_9842_f3_46 


872 


159 


05epll717orf2 


552 


192 


99 


159 


-33 


06epl I202_133293_cl_19 


946 


167 


06cp20302orf3 


602 


167 


100 


167 


0 


06epl 1202_26353438_cl_22 


873 


286 


hp2ell858orf6 


623 


155 


100 


154 


131 


06epl 1202_26353438_cl_22 


873 


286 


02ae21214orfl 


522 


109 


92 


no 


177 


06epl 1202_792962_c2_26.aa 


949 


151 


06cp20302orf6 


604 


66 


92 


66 


85 


06epl 1202_4884677_cl_17 


948 


171 


06cp20302orf5 


603 


171 


99 


171 


0 


06ep 1191 7J24803 1 53_c3_24 


848 


409 


hp4e!4535orf3 


536 


253 


98 


239 


156 


06ep 1191 7_24803 1 53_c3_24 


848 


409 


hp4e!4535orf7 


730 


60 


97 


60 


349 


06ep30223_165 12_c3 J60 


951 


231 


hp3el0128orfl 


720 


231 


100 


231 


0 


06ep30223_23476067_cl J 19 


952 


151 


0lep30520orf8 


511 


94 


93 


95 


57 


06ep30223_23557202_c2_l 30 


874 


329 


hp3ell024orf49 


630 


329 


100 


329 


0 


06ep30223_3440943 7_f3_94 


875 


148 


hp3ell024orf5 


631 


148 


100 


148 


0 


06ep30223_4698838_f2_55 


845 


663 


14gpl20l5orfl3 


662 


344 


100 


334 


319 


06ep30223_4698838J2_55 


845 


663 


I4gpl2015orfl6 


663 


117 


99 


112 


546 


06ep30223_4876077_c3_ 1 49 


877 


113 


hp3ell024orf34 


629 


113 


100 


113 


0 


06ep30223_5 1 0944 3_c 1 J 09 


878 


342 


05eel0411orf5 


551 


293 


95 


287 


49 


06ep30223_527 1902_c J J 06 


879 


207 


hp2el0229orf4 


619 


80 


95 


38 


127 


06gp 1 0409_3398427J2_ 1 2 


880 


115 


13aet051lorG 


583 


115 


92 


115 


0 


06gp7 1 906_1 5 1 1 5637 J2_59 


953 


158 


hp4pll393orf2 


733 


158 


100 


158 


0 


06gp7 1 906_2426 1 588_c2_ 1 74 


881 


96 


0Iael2021orf5 


492 


96 


100 


96 


0 


06gp7 1 906_25478 1 92_c l_l 3 1 


954 


236 


hp3elll22orO 


723 


236 


100 


236 


0 


06gp7 1 906_25504 1 87J3J 1 2 


955 


443 


hp4pll393orfl 


732 


169 


91 


155 


274 


06gp7 1 906_970325_c3_ 1 90 


882 


158 


02ap21M3orf2 


512 


113 


92 


98 


45 


07ae 10923 24426508 fl 1 


883 


137 


Ilgel0309orf28 


685 


137 


100 


137 


0 


07ae 10923 24426508 fl 1 


883 


137 


1 Igel0309orf28 


685 


137 


100 


137 


0 


09cel0413 414011 fl 3 


885 


169 


02cel0216orf6 


516 


172 


100 


169 


-3 
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09cel0413 5865665 f] 4 


886 


353 


02cel0216orf7 


517 


74 


100 


74 


279 


09ce52017 29324062 cl 21 


887 


141 


01ep30520orO4 


506 


141 


100 


141 


o 


09cp 1 0224_ 1 062966_c3_6 1 


888 


367 


05cel0420orfl 


548 


194 


100 


194 


173 


09cp 1 0224_ 1 4 1 27 1 5_c3_56 


889 


185 


0lcel0516orf20 


495 


185 


100 


185 


o 


09cp 1 0224 4295 1 0_c2_46.aa 


890 
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Additional relationships between polypeptides of the invention are described in 
Table 5 below. All of the polypeptide lengths in TableS below are measured from start 
codon to stop codon in the nucleotide sequence of K pylori. 

The relationship between the polypeptides shown in Table 5 can be described as 
follows. First, all of the polypeptides of Table 5 are at least 90% identical with each 
other over most of their lengths, and most are over 95% identical with each other. 
Second, the start-to-stop lengths are different for some of the homologous pairs of 
polypeptides. In some cases, the shorter polypeptide contains the relevant portion of the 
protein exhibiting utility in this invention; in some cases, the longer polypeptide may 
exhibit improved utility. Third, some polypeptides in the second column are 
homologous to two shorter polypeptides in the fifth column. 

In all cases, the homology relationships described in Table 5 are highly 
significant. For example, a typical K pylori gene product will exhibit amino acid 
sequence identities of between 92% and 100% among different strains of K pylori 
selected from human patients. The nucleotide sequences encoding the related 
polypeptides of this invention are also very similar to one another. For example, 
nucleotide probes derived from the coding sequence of a polypeptide of this invention 
can be used in PCR or hybridization experiments to identify clones carrying the 
nucleotide sequence encoding the homologous related polypeptide. 
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Additional relationships between polypeptides of the invention are described in 
Table 6 below. All of the polypeptide lengths in Table 6 below are measured from stop 
codon to stop codon in the nucleotide sequence of//, pylori. 

The relationship between the polypeptides shown in Table 6 can be described as 

5 follows. First, all of the polypeptides of Table 6 are at least 90% identical with each 
other over most of their lengths, and most are over 95% identical with each other. 
Second, the stop-to-stop lengths are different for the homologous pairs of polypeptides. 
In some cases, the shorter polypeptide contains the relevant portion of the protein 
exhibiting utility in this invention; in some cases, the longer polypeptide may exhibit 

10 improved utility. 

In all cases, the homology relationships described in Table 6 are highly 
significant. For example, a typical H. pylori gene product will exhibit amino acid 
sequence identities of between 92% and 100% among different strains of H. pylori 
selected from human patients. The nucleotide sequences encoding the related 

15 polypeptides of this invention are also very similar to one another. For example, 

nucleotide probes derived from the coding sequence of a polypeptide of this invention 
can be used in PCR or hybridization experiments to identify clones carrying the 
nucleotide sequence encoding the homologous related polypeptide. 



WO 97/37044 



PCT/US97/05223 



- 54 - 



SO 



%ID 


00001 


OO'OOl 


OO'OOl 


© 
© 


sd 

ON 


94.35 


o 
© 
©' 
© 


© 
© 

© 


© 
© 
© 
© 


0000! 


| 


rM 

sO 
On" 

ON 


SO 

to 
DO 

On 


0000 1 


00*001 


rM 

DO 
On 
On 


ON 
SO 

sb 

ON 


DO 

00 
On 


99.47 




00 001 


OO'OOl 


00 001 


00 001 


00 001 


00 001 


96.76 


r- 

m 

On 
On 


97.28 | 


Overlap 


o 


rM 
o 


rs 


NO 
rN 


10 
rM 


r- 
r- 


m 
m 
*o 


rN 
m 


m 
rr 


— 

o 

CM 


rM 
rN 


— 

SO 

rM 


OO 
O 

CM 


NO 

cn 


— 

rM 


OO 

NO 
lO 


m 
so 
m 


SO 
OO 


OO 
OO 


r- 
r- 
m 


o 
o 


rN 


CM 
m 


CN 
m 


to 

OO 


»o 

CM 


OO 

r— 

CM 


On 
to 




Leng. 
Diff. 


o 
o 


rM 

tO 


1033 


oo 

NO 


r- 
m 


On 
cn 


to 
no 
rN 


i 


oo 
rM 


CM 
CO 

m 


m 




m 

On 


O 

ON 


NO 


CM 


ON 


NO 

oo 
m 


to 

NO 




NO 
NO 


o 


NO 

to 


m 

On 


to 
m 


NO 


OO 

m 


NO 
OO 




Leng. 


ON 

T 
CM 


NO 


1304 


1484 


o 
o 
m 


NO 

rM 

NO 


oo 

On 

r- 


cn 
cn 


to 
O 


rn 

to 
to 


rn 
oo 


CM 

r- 

CM 


O 

rr 


On 

to 


m 


CM 
— — 


to 
e'- 
en 


r- 
oo 


m 
to 

CM 


m 
oo 
rr 


r- 

NO 

to 


oo 

m 


oo 

NO 

m 


O 
CM 
rr 


m 
CM 
CM 


CM 
CM 
TJ- 


NO 

oo 

NO 


NO 

m 


to 

ON 
CM 


cr ^ 
ea 


m 
o 


OO 

cn 
o 


ON 

rn 

o 


o 
o 


TT 

O 


rM 
O 


m 
o 


O 


to 
O 


NO 

o 


o 


OO 

TT 
O 


ON 

0 


O 

NO 

O 


NO 

O 


CN 

no 
o 


m 
no 
o 


NO 

o 


to 
to 
o 


NO 

to 
o 


*o 

o 


oo 
to 

o 


ON 

to 
o 


o 

NO 

o 


NO 

o 


CM 
NO 

o 


m 

NO 

o 


3 

O 


to 

NO 

o 


ORF Name 


I lael 1922 12586675 f2 1 


02gel0116_23866562_c3_146 


07ee 11402 2458267 c3 108 


07ee50709 10213593 0 77 


O 

^1 

CJ 

rJ 
r- 
o 

ON 
NO 

~l 

to 

NO 

o 

o_ 
1> 

o 


| 09cp 1071 3_23452_c3_l 95 


^5 
°\ 

CM 
O 
CM 

r- 
rM 
rM 

o 

•2. 

ea 


01ce61016 23609580 c3 139 


06gp7l906J024l26_clJ28 


!j 

°i 
oo 

§ 

o 

— 1 
o 
to 
o 

CM 
OD 

NO 

O 


09cpl07l3_34024l87_flJI 


CM 

~i 

cn 
m 

On 
NO 

On 
lO 

m 
«o 

OO 
CM 

NO 

a. 

3. 

JC 


CM 

cn 

CM 

°, 

CM 
O 

NO 
On 

OO 

o 1 

NO 
O 

CO 

m 


m 

Ci| 

CM 
On 

to 

O 

NO 
O 

<L> 
CO 

m 


c 

oo 1 
to 

NO 
CM 

CM 

On 

"EL 
CO 
NO 
O 


I4gpl 1423_2680380l_f3_7 


On 

I 

Ci 

to 
r- 

NO 
NO 
OO 
NO 
CM 

On 
O 
NO 

o 

"5. 

co 

NO 


5 , 

C! 

NO 
ON 

OO 

OO 
"T 

On 

o 
o 

to 
i> 
u 

r- 


02apllil7_23495l87_c3_81 


09cpl07I3J4024187Jl_3I 


o 
oo 

1 

r- 
to 

NO 

to 

to 
m 

1 

On 
O 
r- 
O 
*o 

0> 
V 

r- 
O 


O 
CM 

m 
o 

m 

NO 

r— 

On 

CM 
CM 
O 
m 
a> 

CO 

to 
o 


On 

a 

i 

r- 

CM 

m 
^ ( 

oo 

V 
rn 

o~ 
J= 


NO 

C! 

I 

oo 

m 
to 
o 

oo 

CM 
rn 
O 

Q. 
CO 

rM 


ON 

rr 

I 

rM 

t_> 

1 

OO 

NO 
ON 

to 

OO 

T* 

m 

CM 

CN 
NO 

o 

OO 

o. 

-C 


NO 
CM 

1 

r- 

CN 

oo 

ON 

NO 

NO 
CM 

On 

o» 

r- 

Q. 
-C 


oo 

0 s 

rn 

o 

1 

m 
oo 

ON 

o 

CN 
NO 
CM 

m 
m 
*o 

o> 

3. 


1 

CM 
rr 

OO 

to 

NO 

o 

Cl 
0> 

NO 

O 


06ep30223_34409437_f2_64 


Leng. 


O 
*T 


o 

fN 


rM 


co 
O 
oo 


m 

NO 

rN 


r- 
r— 


cn 
cn 
to 


r- 
to 
m 


r- 
rr 


© 

CM 


cm 

CM 


NO 
CM 


00 
© 

CM 


On 
NO 

en 


CM 


o 
o 

NO 


NO 
NO 

co 


o 


OO 
OO 


CM 
OO 

m 


O 


r- 

CM 


CM 

m 


CM 

m 


oo 
oo 


oo 

to 

CM 


oo 
o 

m 


ON 

to 


oo 


aa Seq 


NO 


OO 

to 


rM 

NO 


3 


r- 

NO 


rN 


no 


oo 


o 
oo 


OO 


NO 

ON 


ON 


On 
O 
OO 


O 
00 


to 

OO 


r- 
oo 


CM 
CM 
OO 


NO 
CM 
OO 


On 
CM 
OO 


m 
m 

OO 


NO 

m 
oo 


r-* 
m 
oo 


oo 
m 
oo 


NO 

oo 


oo 
to 
oo 


to 
oo 


to 

NO 
OO 


CM 

oo 


to 
r- 
oo 


ORFName 


01ael200l 24218781 (2 18 


02eo20706 23866562 c2 53 


rs 

e 

rM 

m 
oo 

ON 

l N 

to 
o 

- "a 

) CO 

r- 
o 


07apl 1213 35401528 cl 21 


05eel08l6 14649077 0 18 


06cp30603 23452 c3 80 


07ap20216 7227202 f3 10 


09cp 11003 19532625 c3 17 


to 
rM 

CJ 

NO 
CM 

CM 
O 

cn 

NO 

O 

m 

S 

CM 


I3ael07l2 14100018 f2 12 


hp6e20339 34024187 cl 37 


01 eel 0203 35281542 c3 16 


O 

m 
10 

NO 

00 
0 

CM 

rM 

NO 

OO 
O 

0> 

si 

■<T 

0 


04eel0816 33726080 c2 29 


06ap 11119 24426508 0 26 


07cel 1019 22051291 fl 1 


14apl022l 13689381 c3 4 


so 

CJ 

en 
to 
rM 
to 
m 

CN 

m 
O 
m 
. eu 
ox 

O 

rM 


hp2p 10272 22692325 0 21 


hp6e20339 24317062 c3 57 


07apl 1213 35156577 cl 24 


07ap80601 976413 0 9 j 


Ci 

to 
r— 
o 

ON 

OO 

o 
oo 
o 

cx 
m 


I2ap 10324 4805318 12 3 


03ae 10804 23485968 c3 47 


05cel0910 25598277 0 3 


06cpll2l7 19720300 f3 II 


NO 

c 

CM 

OO 
ON 

to 

NO 

o 
. a 

NO 

o 


On 

I 

m 

ON 

o 

ta- 
rn 

1 1 

ro 
CM 

CN 

o 

m 
<u 

NO 

o 



BNSDOCID: <WO 9737044A1J_> 



BNS page 56 



WO 97/37044 



4» 



• 



PCT7US97/05223 



- 55 - 



3 



© 

SO 
GO 

2 



3 







ON 




m 


^ 1 




°°l 




c 


o 


°. 




t 




rJ 




VO 


oo 




o 


o 




*o 






r- 






rr 










"3" 


1 


o' 


v~% 


ro 






rsi 


o 


oo 


rs» 




o 


o 


m 




m 




o. 


O. 


CQ 


V 


<l» 






v© 


o 


o 


o 



BNS page 57 



WO 97/37044 




PCTAJS97/0S223 



-56- 

IV. Identification of Nucleic Acids Encoding Vaccine Components and Targets for 
Agents Effective Against H. pylori 

The disclosed K pylori genome sequence includes segments that direct the 

synthesis of ribonucleic acids and polypeptides, as well as origins of replication, 

5 promoters, other types of regulatory sequences, and intergenic nucleic acids. The 

invention encompasses the identification of nucleic acids encoding immunogenic 

components of vaccines and targets for agents effective against H. pylori. An important 

aspect of this identification is to determine the function of the disclosed sequences, 

which can be achieved using a variety of approaches. Non-limiting examples of these 

10 methods are described briefly below. 

Homology to known sequences: Computer-assisted comparison of the disclosed 
H. pylori sequences with previously reported sequences present in publicly available 
databases is a useful tool for identifying functional H. pylori nucleic acid and 
polypeptide sequences. It will be understood that protein-coding sequences, for 

1 5 example, may be compared as a whole, and that a high degree of sequence homology 
between two proteins (such as, for example, >80-90%) at the amino acid level is 
strongly suggestive that the two proteins also possess some degree of functional 
homology, such as, for example, among enzymes involved in metabolism, DNA 
synthesis, or cell wall synthesis, and proteins involved in transport, cell division, etc. In 

20 addition, many structural features of particular protein classes have been identified and 
correlate with specific consensus sequences, such as, for example, binding domains for 
nucleotides, DNA, metal ions, and other small molecules; sites for covalent 
modifications such as phosphorylation, acylation, and the like; sites of protein:protein 
interactions, etc. These consensus sequences may be quite short and thus may represent 

25 only a fraction of the entire protein-coding sequence. Identification of such a feature in 
an H. pylori sequence is therefore useful in determining the function of the encoded 
protein and identifying potentially useful targets of antibacterial drugs. 

Of particular relevance to the present invention are structural features that are 
common to secretory, transmembrane, and surface proteins, including secretion signal 

30 peptides and hydrophobic transmembrane domains. K pylori proteins identified as 
containing putative signal sequences and/or transmembrane domains are useful as 
immunogenic components of vaccines. 

Identification of essential genes: Nucleic acids that encode proteins essential for 
growth or viability of H. pylori are preferred drug targets. H. pylori genes can be tested 

35 for their biological relevance to the organism by examining the effect of deleting and/or 
disrupting the genes, i.e., by so-called gene "knockout", using techniques known to 
those skilled in the relevant art. In this manner, essential genes may be identified. 
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S train- specific sequences: Because of the evolutionary relationship between 
different K pylori strains, it is believed that the presently disclosed H. pylori sequences 
are useful for identifying, and/or discriminating between, previously known and new //. 
pylori strains. It is believed that other H. pylori strains will exhibit at least 70% 

5 sequence homology with the presently disclosed sequence, although whether or not this 
is correct is not essential to the invention. Systematic and routine analyses of DNA 
sequences derived from samples containing H. pylori strains, and comparison with the 
present sequence allows for the identification of sequences that can be used to 
discriminate between strains, as well as those that are common to all H. pylori strains. 

1 0 In one embodiment, the invention provides nucleic acids, including probes, and peptide 
and polypeptide sequences that discriminate between different strains of H. pylori. 
Strain-specific components can also be identified functionally by their ability to elicit or 
react with antibodies that selectively recognize one or more H. pylori strains. 

In another embodiment, the invention provides nucleic acids, including probes, 

1 5 and peptide and polypeptide sequences that are common to all K pylori strains but are 
not found in other bacterial species. 

Specific Example: Determination Of Candidate Protein Antigens For Antibody And 
Vaccine Development 

20 The selection of candidate protein antigens for vaccine development can be 

derived from the nucleic acids encoding H. pylori polypeptides. First, the ORF's can be 
analyzed for homology to other known exported or membrane proteins and analyzed 
using the discriminant analysis described by Klein, et al. (Klein, P., Kanehsia, M., and 
DeLisi, C. (1985) Biochimica et Biophysica Acta 815, 468-476) for predicting exported 

25 and membrane proteins. 

Homology searches can be performed using the BLAST algorithm contained in 
the Wisconsin Sequence Analysis Package (Genetics Computer Group, University 
Research Park, 575 Science Drive, Madison, WI 5371 1) to compare each predicted ORF 
amino acid sequence with all sequences found in the current GenBank, SWISS-PROT 

30 and PIR databases. BLAST searches for local alignments between the ORF and the 
databank sequences and reports a probability score which indicates the probability of 
finding this sequence by chance in the database. ORF's with significant homology (e.g. 
probabilities better than 1x10 (ee-6)) to membrane or exported proteins represent likely 
protein antigens for vaccine development. Possible functions can be provided to H. 

35 pylori genes based on sequence homology to genes cloned in other organisms. 

Discriminant analysis (Klein, et al. supra) can be used to examine the ORF 
amino acid sequences. This algorithm uses the intrinsic information contained in the 
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ORF amino acid sequence and compares it to information derived from the properties of 
known membrane and exported proteins. This comparison predicts which proteins will 
be exported, membrane associated or cytoplasmic. ORF amino acid sequences 
identified as exported or membrane associated by this algorithm are likely protein 
5 antigens for vaccine development. 

Surface exposed outer membrane proteins are likely to represent the best 
antigens to provide a protective immune response against H. pylori. Among the 
algorithms that can be used to aid in prediction of these outer membrane proteins include 
the presence of an amphipathic beta-sheet region at their C-terminus. This region which 
1 0 has been detected in a large number of outer membrane proteins in Gram negative 

bacteria is often characterized by hydrophobic residues (Phe or Tyr) approximately at 
positions 1, 3, 5, 7 and 9 from the C-terminus (e.g., see Figure 8, block F). Importantly, 
these sequences have not been detected at the C -termini of periplasmic proteins, thus 
allowing preliminary distinction between these classes of proteins based on primary 
15 sequence data. This phenomenon has been reported previously by Struyve et al. (J. MoL 
Biol. 218:141-148, 1991). 

Also illustrated in Figure 8 are additional amino acid sequence motifs found in 
many outer membrane proteins of//, pylori. The amino acid sequence alignment in 
Figure 8 depicts portions of the sequence of 12 K pylori proteins (depicted in the single 
20 letter amino acid code) labeled with their amino acid Sequence ID Numbers and shown 
N-terminal to C-terminal, left to right. Six distinct blocks (labeled A through F) of 
similar amino acid residues are found including the distinctive hydrophobic residues 
(Phe or Tyr; F or Y according to the single letter code for amino acid residues) 
frequently found at positions near the C-terminus of outer membrane proteins. The 
25 presence of several shared motifs clearly establishes the similarity between members of 
this group of proteins. 

In addition, outer membrane proteins isolated from H. pylori frequently share a 
motif near the mature N-terminus (i.e., after processing to remove the secretion signal) 
as illustrated in the blocked amino acid residues in Figure 9. Figure 9 depicts the N- 
30 terminal portion of nine H. pylori proteins (designated by their amino acid Sequence ID 
Numbers and shown N-terminal to C-terminal, left to right). 

One skilled in the art would know that these shared sequence motifs are highly 
significant and establish a similarity among this group of proteins. 

Infrequently it is not possible to distinguish between multiple possible 
35 nucleotides at a given position in the nucleic acid sequence. In those cases the 
ambiguities are denoted by an extended alphabet as follows: 
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These are the official IUPAC-IUB single-letter base codes 



Code 


Base Description 




G 


Guanine 




A 


Adenine 




T 


Thymine 




C 


Cytosine 




R 


Purine 


(A or G) 


Y 


Pyrimidine 


(C or T or U) 


M 


Amino 


(A or C) 


K 


Ketone 


(G or T) 


S 


Strong interaction 


(C or G) 


W 


Weak interaction 


(A or T) 


H 


Not-G 


(A or C or T) 


B 


Not-A 


(C or G or T) 


V 


Not-T (not-U) 


(A or C or G) 


D 


Not-C 


(A or G or T) 


N 


Any 


(A or C or G or T) 



The amino acid translations of this invention account for the ambiguity in the 
5 nucleic acid sequence by translating the ambiguous codon as the letter M X n . In all cases, 
the permissible amino acid residues at a position are clear from an examination of the 
nucleic acid sequence based on the standard genetic code. 

V. Production of Fragments and Analogs of H. pylori Nucleic Acids and Polypeptides 

10 Based on the discovery of the H. pylori gene products provided in the Sequence 

Listing, one skilled in the art can alter the disclosed structure (of//, pylori genes), e.g., 
by producing fragments or analogs, and test the newly produced structures for activity. 
Examples of techniques known to those skilled in the relevant art which allow the 
production and testing of fragments and analogs are discussed below. These, or 

1 5 analogous methods can be used to make and screen libraries of polypeptides, e.g., 

libraries of random peptides or libraries of fragments or analogs of cellular proteins for 
the ability to bind H. pylori polypeptides. Such screens are useful for discovery of 
inhibitors of K pylori. 

Generation of Fragments 

20 Fragments of a protein can be produced in several ways, e.g., recombinantly, by 

proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a 
polypeptide can be generated by removing one or more nucleotides from one end (for a 
terminal fragment) or both ends (for an internal fragment) of a nucleic acid which 
encodes the polypeptide. Expression of the mutagenized DNA produces polypeptide 

25 fragments. Digestion with "end-nibbling" endonucleases can thus generate DNA's 

which encode an array of fragments. DNA's which encode fragments of a protein can 
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also be generated by random shearing, restriction digestion or a combination of the 
above-discussed methods. 

Fragments can also be chemically synthesized using techniques known in the art 
such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, 
5 peptides of the present invention may be arbitrarily divided into fragments of desired 
length with no overlap of the fragments, or divided into overlapping fragments of a 
desired length. 

Alteration of Nucleic Acids and Polypeptides: Random Methods 
Amino acid sequence variants of a protein can be prepared by random 
10 mutagenesis of DNA which encodes a protein or a particular domain or region of a 
protein. Useful methods include PCR mutagenesis and saturation mutagenesis. A 
library of random amino acid sequence variants can also be generated by the synthesis of 
a set of degenerate oligonucleotide sequences. (Methods for screening proteins in a 
library of variants are elsewhere herein). 
15 (A) PCR Mutagenesis 

In PCR mutagenesis, reduced Taq polymerase fidelity is used to 
introduce random mutations into a cloned fragment of DNA (Leung et al., 1989, 
Technique 1:11-15). The DNA region to be mutagenized is amplified using the 
polymerase chain reaction (PCR) under conditions that reduce the fidelity of DNA 
20 synthesis by Taq DNA polymerase, e.g., by using a dGTP/dATP ratio of five and adding 
Mn 2+ to the PCR reaction. The pool of amplified DNA fragments are inserted into 
appropriate cloning vectors to provide random mutant libraries. 
(B^ Saturation Mutagenesis 

Saturation mutagenesis allows for the rapid introduction of a large 
25 number of single base substitutions into cloned DNA fragments (Mayers et al., 1985, 
Science 229:242). This technique includes generation of mutations, e.g., by chemical 
treatment or irradiation of single-stranded DNA in vitro, and synthesis of a 
complimentary DNA strand. The mutation frequency can be modulated by modulating 
the severity of the treatment, and essentially all possible base substitutions can be 
30 obtained. Because this procedure does not involve a genetic selection for mutant 

fragments both neutral substitutions, as well as those that alter function, are obtained. 
The distribution of point mutations is not biased toward conserved sequence elements. 
(C) Degenerate Oligonucleotides 

A library of homologs can also be generated from a set of degenerate 
35 oligonucleotide sequences. Chemical synthesis of a degenerate sequences can be carried 
out in an automatic DNA synthesizer, and the synthetic genes then ligated into an 
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appropriate expression vector. The synthesis of degenerate oligonucleotides is known in 
the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) 
Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, 
Amsterdam: Elsevier pp273-289; Itakura et al. (\9&4)Annu. Rev. Biochem. 53:323; 
5 Itakura etal. (1984) Science 198:1056; Ikeet al. (1983) Nucleic Acid Res. 11:477. Such 
techniques have been employed in the directed evolution of other proteins (see, for 
example, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429- 
2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378- 
6382; as well as U.S. Patents Nos. 5,223,409, 5,198,346, and 5,096,815). 

10 Alteration of Nucleic Acids and Polypeptides: Methods for Directed Mutagenesis 

Non-random or directed, mutagenesis techniques can be used to provide specific 
sequences or mutations in specific regions. These techniques can be used to create 
variants which include, e.g., deletions, insertions, or substitutions, of residues of the 
known amino acid sequence of a protein. The sites for mutation can be modified 

15 individually or in series, e.g., by (1) substituting first with conserved amino acids and 
then with more radical choices depending upon results achieved, (2) deleting the target 
residue, or (3) inserting residues of the same or a different class adjacent to the located 
site, or combinations of options 1-3. 

(A) Alanine Scanning Mutagenesis 

20 Alanine scanning mutagenesis is a useful method for identification of 

certain residues or regions of the desired protein that are preferred locations or domains 
for mutagenesis, Cunningham and Wells (Science 244: 1081-1085, 1989). In alanine 
scanning, a residue or group of target residues are identified (e.g., charged residues such 
as Arg, Asp, His, Lys, and Glu) and replaced by a neutral or negatively charged amino 

25 acid (most preferably alanine or polyalanine). Replacement of an amino acid can affect 
the interaction of the amino acids with the surrounding aqueous environment in or 
outside the cell. Those domains demonstrating functional sensitivity to the substitutions 
are then refined by introducing further or other variants at or for the sites of substitution. 
Thus, while the site for introducing an amino acid sequence variation is predetermined, 

30 the nature of the mutation per se need not be predetermined. For example, to optimize 
the performance of a mutation at a given site, alanine scanning or random mutagenesis 
may be conducted at the target codon or region and the expressed desired protein subunit 
variants are screened for the optimal combination of desired activity. 

(B) Oligonucleotide-Mediated Mutagenesis 

35 Oligonucleotide-mediated mutagenesis is a useful method for preparing 

substitution, deletion, and insertion variants of DNA, see, e.g., Adelman et al., (DNA 
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2: 1 83, 1983). Briefly, the desired DNA is altered by hybridizing an oligonucleotide 
encoding a mutation to a DNA template, where the template is the single-stranded form 
of a plasmid or bacteriophage containing the unaltered or native DNA sequence of the 
desired protein. After hybridization, a DNA polymerase is used to synthesize an entire 
5 second complementary strand of the template that will thus incorporate the 

oligonucleotide primer, and will code for the selected alteration in the desired protein 
DNA. Generally, oligonucleotides of at least 25 nucleotides in length are used. An 
optimal oligonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the nucleotide(s) coding for the 
10 mutation. This ensures that the oligonucleotide will hybridize properly to the single- 
stranded DNA template molecule. The oligonucleotides are readily synthesized using 
techniques known in the art such as that described by Crea et al. (Proc. Natl. Acad. Sci. 
USA, 75:5765[1978]). 

(C) Cassette Mutagenesis 

1 5 Another method for preparing variants, cassette mutagenesis, is based on 

the technique described by Wells et al. (Gene, 34:3 15[1 985]). The starting material is a 
plasmid (or other vector) which includes the protein subunit DNA to be mutated. The 
codon(s) in the protein subunit DNA to be mutated are identified. There must be a 
unique restriction endonuclease site on each side of the identified mutation site(s). If no 

20 such restriction sites exist, they may be generated using the above-described 

oligonucleotide-mediated mutagenesis method to introduce them at appropriate locations 
in the desired protein subunit DNA. After the restriction sites have been introduced into 
the plasmid, the plasmid is cut at these sites to linearize it. A double-stranded 
oligonucleotide encoding the sequence of the DNA between the restriction sites but 

25 containing the desired mutation(s) is synthesized using standard procedures. The two 
strands are synthesized separately and then hybridized together using standard 
techniques. This double-stranded oligonucleotide is referred to as the cassette. This 
cassette is designed to have 3' and 5' ends that are comparable with the ends of the 
linearized plasmid, such that it can be directly ligated to the plasmid. This plasmid now 

30 contains the mutated desired protein subunit DNA sequence. 

(D) Combinatorial Mutagenesis 

Combinatorial mutagenesis can also be used to generate mutants (Ladner 
et al., WO 88/06630). In this method, the amino acid sequences for a group of homologs 
or other related proteins are aligned, preferably to promote the highest homology 
35 possible. All of the amino acids which appear at a given position of the aligned 

sequences can be selected to create a degenerate set of combinatorial sequences. The 
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variegated library of variants is generated by combinatorial mutagenesis at the nucleic 
acid level, and is encoded by a variegated gene library. For example, a mixture of 
synthetic oligonucleotides can be enzymatically ligated into gene sequences such that 
the degenerate set of potential sequences are expressible as individual peptides, or 
5 alternatively, as a set of larger fusion proteins containing the set of degenerate 
sequences. 

Other Modifications of H. pylori Nucleic Acids and Polypeptides 
It is possible to modify the structure of an H. pylori polypeptide for such 
purposes as increasing solubility, enhancing stability (e.g., shelf life ex vivo and 

10 resistance to proteolytic degradation in vivo). A modified H. pylori protein or peptide 
can be produced in which the amino acid sequence has been altered, such as by amino 
acid substitution, deletion, or addition as described herein. 

An H. pylori peptide can also be modified by substitution of cysteine residues 
preferably with alanine, serine, threonine, leucine or glutamic acid residues to minimize 

1 5 dimerization via disulfide linkages. In addition, amino acid side chains of fragments of 
the protein of the invention can be chemically modified. Another modification is 
cyclization of the peptide. 

In order to enhance stability and/or reactivity, an H. pylori polypeptide can be 
modified to incorporate one or more polymorphisms in the amino acid sequence of the 

20 protein resulting from any natural allelic variation. Additionally, D-amino acids, non- 
natural amino acids, or non-amino acid analogs can be substituted or added to produce a 
modified protein within the scope of this invention. Furthermore, an H. pylori 
polypeptide can be modified using polyethylene glycol (PEG) according to the method 
of A. Sehon and co-workers (Wie et al., supra) to produce a protein conjugated with 

25 PEG. In addition, PEG can be added during chemical synthesis of the protein. Other 
modifications of H. pylori proteins include reduction/alky lation (Tarr, Methods of 
Protein Microcharacterization, J. E. Silver ed., Humana Press, Clifton NJ 155-194 
(1986)); acylation (Tarr, supra); chemical coupling to an appropriate carrier (Mishell and 
Shiigi, eds, Selected Methods in Cellular Immunology, WH Freeman, San Francisco, CA 

30 (1980), U.S. Patent 4,939,239; or mild formalin treatment (Marsh, (1971) Int. Arch, of 
Allergy and Appl Immunol, 41_: 199-215). 

To facilitate purification and potentially increase solubility of an H. pylori 
protein or peptide, it is possible to add an amino acid fusion moiety to the peptide 
backbone. For example, hexa-histidine can be added to the protein for purification by 

35 immobilized metal ion affinity chromatography (Hochuli, E. et al., (1988) 

Bio/Technology, 6: 1321 - 1325). In addition, to facilitate isolation of peptides free of 
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irrelevant sequences, specific endoprotease cleavage sites can be introduced between the 
sequences of the fusion moiety and the peptide. 

To potentially aid proper antigen processing of epitopes within an H. pylori 
polypeptide, canonical protease sensitive sites can be engineered between regions, each 
5 comprising at least one epitope via recombinant or synthetic methods. For example, 
charged amino acid pairs, such as KK or RR, can be introduced between regions within 
a protein or fragment during recombinant construction thereof. The resulting peptide 
can be rendered sensitive to cleavage by cathepsin and/or other trypsin-like enzymes 
which would generate portions of the protein containing one or more epitopes. In 
10 addition, such charged amino acid residues can result in an increase in the solubility of 
the peptide. 

Primary Methods for Screening Polypeptides and Analogs 
Various techniques are known in the art for screening generated mutant gene 
products. Techniques for screening large gene libraries often include cloning the gene 

15 library into replicable expression vectors, transforming appropriate cells with the 
resulting library of vectors, and expressing the genes under conditions in which 
detection of a desired activity, e.g., in this case, binding to H. pylori polypeptide or an 
interacting protein, facilitates relatively easy isolation of the vector encoding the gene 
whose product was detected. Each of the techniques described below is amenable to 

20 high through-put analysis for screening large numbers of sequences created, e.g., by 
random mutagenesis techniques. 

(A) Two Hybrid Systems 

Two hybrid assays such as the system described above (as with the other 
screening methods described herein), can be used to identify polypeptides, e.g., 

25 fragments or analogs of a naturally-occurring K pylori polypeptide, e.g., of cellular 
proteins, or of randomly generated polypeptides which bind to an H. pylori protein. 
(The H. pylori domain is used as the bait protein and the library of variants are expressed 
as fish fusion proteins.) In an analogous fashion, a two hybrid assay (as with the other 
screening methods described herein), can be used to find polypeptides which bind a H. 

30 pylori polypeptide. 

(B) Display Libraries 

In one approach to screening assays, the candidate peptides are displayed 
on the surface of a cell or viral particle, and the ability of particular cells or viral 
particles to bind an appropriate receptor protein via the displayed product is detected in a 
35 "panning assay". For example, the gene library can be cloned into the gene for a surface 
membrane protein of a bacterial cell, and the resulting fusion protein detected by 
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panning (Ladner et al., WO 88/06630; Fuchs et al. (1991 ) Bio/Technology 9: 1 370-1 371 ; 
and Goward et al. (1992) 7755 1 8: 136-140). In a similar fashion, a detectably labeled 
ligand can be used to score for potentially functional peptide homologs. Fluorescently 
labeled ligands, e.g., receptors, can be used to detect homolog which retain ligand- 
5 binding activity. The use of fluorescently labeled ligands, allows cells to be visually 
inspected and separated under a fluorescence microscope, or, where the morphology of 
the cell permits, to be separated by a fluorescence-activated cell sorter. 

A gene library can be expressed as a fusion protein on the surface of a viral 
particle. For instance, in the filamentous phage system, foreign peptide sequences can 

10 be expressed on the surface of infectious phage, thereby conferring two significant 
benefits. First, since these phage can be applied to affinity matrices at concentrations 
well over 10*3 phage per milliliter, a large number of phage can be screened at one time. 
Second, since each infectious phage displays a gene product on its surface, if a particular 
phage is recovered from an affinity matrix in low yield, the phage can be amplified by 

15 another round of infection. The group of almost identical E. coli filamentous phages 
Ml 3, fd M and fl are most often used in phage display libraries. Either of the phage gill 
or gVIII coat proteins can be used to generate fusion proteins without disrupting the 
ultimate packaging of the viral particle. Foreign epitopes can be expressed at the NH2- 
terminal end of pill and phage bearing such epitopes recovered from a large excess of 

20 phage lacking this epitope (Ladner et al. PCT publication WO 90/02909; Garrard et al., 
PCT publication WO 92/09690; Marks et al. (1992) J. Biol Chem, 267:16007-16010; 
Griffiths et al. (1993) EMBOJ 12:725-734; Clackson et al. (1991) Nature 352:624-628; 
and Barbas et al. (1992) PNAS 89:4457-4461). 

A common approach uses the maltose receptor of E. coli (the outer membrane 

25 protein, LamB) as a peptide fusion partner (Charbit et al. (1986) EMBO 5, 3029-3037). 
Oligonucleotides have been inserted into plasmids encoding the LamB gene to produce 
peptides fused into one of the extracellular loops of the protein. These peptides are 
available for binding to ligands, e.g., to antibodies, and can elicit an immune response 
when the cells are administered to animals. Other cell surface proteins, e.g., OmpA 

30 (Schorr et al. (1991 ) Vaccines 91, PP- 387-392), PhoE (Agterberg, et al. (1990) Gene 88, 
37-45), and PAL (Fuchs et al. (1991) Bio/Tech 9, 1369-1372), as well as large bacterial 
surface structures have served as vehicles for peptide display. Peptides can be fused to 
pilin, a protein which polymerizes to form the pilus-a conduit for interbacterial exchange 
of genetic information (Thiry et al. (\9%9)AppL Environ. Microbiol 55, 984-993). 

35 Because of its role in interacting with other cells, the pilus provides a useful support for 
the presentation of peptides to the extracellular environment. Another large surface 
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structure used for peptide display is the bacterial motive organ, the flagellum. Fusion of 
peptides to the subunit protein flagellin offers a dense array of may peptides copies on 
the host cells (Kuwajima et ah (1988) Bio/Tech. 6, 1080-1083). Surface proteins of 
other bacterial species have also served as peptide fusion partners. Examples include the 
5 Staphylococcus protein A and the outer membrane protease IgA of Neisseria (Hansson 
et al. ( 1 992) J. Bacteriol 1 74, 4239-4245 and Klauser et al. ( 1 990) EMBO J. 9, 1 99 1 - 
1999). 

In the filamentous phage systems and the LamB system described above, the 
physical link between the peptide and its encoding DN A occurs by the containment of 
1 0 the DN A within a particle (cell or phage) that carries the peptide on its surface. 

Capturing the peptide captures the particle and the DNA within. An alternative scheme 
uses the DNA-binding protein LacI to form a link between peptide and DNA (Cull et al 
(1992) PNAS USA 89:1865-1869). This system uses aplasmid containing the LacI gene 
with an oligonucleotide cloning site at its 3'-end. Under the controlled induction by 
1 5 arabinose, a LacI -peptide fusion protein is produced. This fusion retains the natural 
ability of LacI to bind to a short DNA sequence known as LacO operator (LacO). By 
installing two copies of LacO on the expression plasmid, the Lacl-peptide fusion binds 
tightly to the plasmid that encoded it. Because the plasmids in each cell contain only a 
single oligonucleotide sequence and each cell expresses only a single peptide sequence, 
20 the peptides become specifically and stably associated with the DNA sequence that 
directed its synthesis. The cells of the library are gently lysed and the peptide-DNA 
complexes are exposed to a matrix of immobilized receptor to recover the complexes 
containing active peptides. The associated plasmid DNA is then reintroduced into cells 
for amplification and DNA sequencing to determine the identity of the peptide ligands. 
25 As a demonstration of the practical utility of the method, a large random library of 
dodecapeptides was made and selected on a monoclonal antibody raised against the 
opioid peptide dynorphin B. A cohort of peptides was recovered, all related by a 
consensus sequence corresponding to a six-residue portion of dynorphin B. (Cull et al. 
(1992) /Voc. Natl. Acad Sci. U.S.A. 89-1869) 
30 This scheme, sometimes referred to as peptides-on-plasmids, differs in two 

important ways from the phage display methods. First, the peptides are attached to the 
C-terminus of the fusion protein, resulting in the display of the library members as 
peptides having free carboxy termini. Both of the filamentous phage coat proteins, pill 
and pVIII, are anchored to the phage through their C-termini, and the guest peptides are 
35 placed into the outward-extending N-terminal domains. In some designs, the phage- 
displayed peptides are presented right at the amino terminus of the fusion protein. 
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(Cwirla, et al. (1990) Proc. Natl. Acad. Set U.S.A. 87, 6378-6382) A second difference 
is the set of biological biases affecting the population of peptides actually present in the 
libraries. The LacI fusion molecules are confined to the cytoplasm of the host cells. 
The phage coat fusions are exposed briefly to the cytoplasm during translation but are 
5 rapidly secreted through the inner membrane into the periplasm ic compartment, 

remaining anchored in the membrane by their C-terminal hydrophobic domains, with the 
N-termini, containing the peptides, protruding into the periplasm while awaiting 
assembly into phage particles. The peptides in the LacI and phage libraries may differ 
significantly as a result of their exposure to different proteolytic activities. The phage 
1 0 coat proteins require transport across the inner membrane and signal peptidase 

processing as a prelude to incorporation into phage. Certain peptides exert a deleterious 
effect on these processes and are underrepresented in the libraries (Gallop et al. (1994) J. 
Med. Chem. 37(9): 1233-125 1). These particular biases are not a factor in the LacI 
display system. 

1 5 The number of small peptides available in recombinant random libraries is 

enormous. Libraries of 10?- 10^ independent clones are routinely prepared. Libraries as 
large as 10* * recombinants have been created, but this size approaches the practical 
limit for clone libraries. This limitation in library size occurs at the step of transforming 
the DNA containing randomized segments into the host bacterial cells. To circumvent 

20 this limitation, an in vitro system based on the display of nascent peptides in polysome 
complexes has recently been developed. This display library method has the potential of 
producing libraries 3-6 orders of magnitude larger than the currently available 
phage/phagemid or plasmid libraries. Furthermore, the construction of the libraries, 
expression of the peptides, and screening, is done in an entirely cell-free format. 

25 In one application of this method (Gallop et al. (1994) J. Med. Chem. 

37(9): 1233- 1251), a molecular DNA library encoding 10 12 decapeptides was 
constructed and the library expressed in an E. coli S30 in vitro coupled 
transcription/translation system. Conditions were chosen to stall the ribosomes on the 
mRNA, causing the accumulation of a substantial proportion of the RNA in polysomes 

30 and yielding complexes containing nascent peptides still linked to their encoding RNA. 
The polysomes are sufficiently robust to be affinity purified on immobilized receptors in 
much the same way as the more conventional recombinant peptide display libraries are 
screened. RNA from the bound complexes is recovered, converted to cDNA, and 
amplified by PCR to produce a template for the next round of synthesis and screening. 

35 The polysome display method can be coupled to the phage display system. Following 
several rounds of screening, cDNA from the enriched pool of polysomes was cloned into 
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a phagemid vector. This vector serves as both a peptide expression vector, displaying 
peptides fused to the coat proteins, and as a DN A sequencing vector for peptide 
identification. By expressing the poly some-derived peptides on phage, one can either 
continue the affinity selection procedure in this format or assay the peptides on 
5 individual clones for binding activity in a phage ELISA, or for binding specificity in a 
completion phage ELISA (Barret, et al. (1992) Anal Biochem 204,357-364). To 
identify the sequences of the active peptides one sequences the DNA produced by the 
phagemid host. 

Secondary Screening of Polypeptides and Analogs 
10 The high through-put assays described above can be followed by secondary 

screens in order to identify further biological activities which will, e.g., allow one skilled 
in the art to differentiate agonists from antagonists. The type of a secondary screen used 
will depend on the desired activity that needs to be tested. For example, an assay can be 
developed in which the ability to inhibit an interaction between a protein of interest and 
1 5 its respective ligand can be used to identify antagonists from a group of peptide 
fragments isolated though one of the primary screens described above. 

Therefore, methods for generating fragments and analogs and testing them for 
activity are known in the art. Once the core sequence of interest is identified, it is 
routine to perform for one skilled in the art to obtain analogs and fragments. 
20 Peptide Mimetics of H. pylori Polypeptides 

The invention also provides for reduction of the protein binding domains of the 
subject K pylori polypeptides to generate mimetics, e.g. peptide or non-peptide agents. 
The peptide mimetics are able to disrupt binding of a polypeptide to its counter ligand, 
e.g., in the case of an K pylori polypeptide binding to a naturally occurring ligand. The 
25 critical residues of a subject H. pylori polypeptide which are involved in molecular 

recognition of a polypeptide can be determined and used to generate K #y/on-derived 
peptidomimetics which competitively or noncompetitively inhibit binding of the K 
pylori polypeptide with an interacting polypeptide (see, for example, European patent 
applications EP-412,762A and EP-B31,080A). 
30 For example, scanning mutagenesis can be used to map the amino acid residues 

of a particular K pylori polypeptide involved in binding an interacting polypeptide, 
peptidomimetic compounds (e.g. diazepine or isoquinoline derivatives) can be generated 
which mimic those residues in binding to an interacting polypeptide, and which 
therefore can inhibit binding of an H. pylori polypeptide to an interacting polypeptide 
35 and thereby interfere with the function of//, pylori polypeptide. For instance, non- 

hydrolyzable peptide analogs of such residues can be generated using benzodiazepine 
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(e.g., see Freidinger et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., 
ESCOM Publisher: Leiden, Netherlands, 1988), azepine (e.g., see Huffman et al. in 
Peptides: Chemistry and Biology ; G.R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988), substituted gama lactam rings (Garvey et al. in Peptides: Chemistry 
5 and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto- 
methylene pseudopeptides (Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et 
al. in Peptides: Structure and Function (Proceedings of the 9th American Peptide 
Symposium) Pierce Chemical Co. Rockland, IL, 1985), (i-turn dipeptide cores (Nagai et 
al. (1985) Tetrahedron Lett 26:647; and Sato et al. (1986) J Chem Soc Per kin Trans 
10 1 : 123 1), and p-aminoalcohols (Gordon et al. (1985) Biochem Biophys Res 

Commun 126:4 19; and Dann et al. (1986) Biochem Biophys Res Commun 134:71). 

VI. Vaccine Formulations for //. pylori Nucleic Acids and Polypeptides 

This invention also features vaccine compositions for protection against infection 

15 by H. pylori or for treatment of H. pylori infection, a gram-negative spiral 

microaerophilic bacterium. In one embodiment, the vaccine compositions contain one 
or more immunogenic components such as a surface protein from H. pylori, or portion 
thereof, and a pharmaceutical^ acceptable carrier. Nucleic acids within the scope of the 
invention are exemplified by the nucleic acids shown in the Sequence Listing which 

20 encode H. pylori surface proteins. However, any nucleic acid encoding an immunogenic 
H. pylori protein, or portion thereof, which is capable of expression in a cell, can be used 
in the present invention. These vaccines have therapeutic and prophylactic utilities. 

One aspect of the invention provides a vaccine composition for protection 
against infection by H. pylori which contains at least one immunogenic fragment of an 

25 H. pylori protein and a pharmaceutical ly acceptable carrier. Preferred fragments include 
peptides of at least about 10 amino acid residues in length, preferably about 10-20 amino 
acid residues in length, and more preferably about 12-16 amino acid residues in length. 

Immunogenic components of the invention can be obtained, for example, by 
screening polypeptides recombinantly produced from the corresponding fragment of the 

30 nucleic acid encoding the full-length H. pylori protein. In addition, fragments can be 
chemically synthesized using techniques known in the art such as conventional 
Merrifield solid phase f-Moc or t-Boc chemistry. 

In one embodiment, immunogenic components are identified by the ability of the 
peptide to stimulate T cells. Peptides which stimulate T cells, as determined by, for 

35 example, T cell proliferation or cytokine secretion are defined herein as comprising at 
least one T cell epitope. T cell epitopes are believed to be involved in initiation and 
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perpetuation of the immune response to the protein allergen which is responsible for the 
clinical symptoms of allergy. These T cell epitopes are thought to trigger early events at 
the level of the T helper cell by binding to an appropriate HLA molecule on the surface 
of an antigen presenting cell, thereby stimulating the T cell subpopulation with the 
5 relevant T cell receptor for the epitope. These events lead to T cell proliferation, 

lymphokine secretion, local inflammatory reactions, recruitment of additional immune 
cells to the site of antigen/T cell interaction, and activation of the B cell cascade, leading 
to the production of antibodies. A T cell epitope is the basic element, or smallest unit of 
recognition by a T cell receptor, where the epitope comprises amino acids essential to 
1 0 receptor recognition (e.g., approximately 6 or 7 amino acid residues). Amino acid 
sequences which mimic those of the T cell epitopes are within the scope of this 
invention. 

In another embodiment, immunogenic components of the invention are identified 
through genomic vaccination. The basic protocol is based on the idea that expression 

1 5 libraries consisting of all or parts of a pathogen genome, e.g., an H. pylori genome, can 
confer protection when used to genetically immunize a host. This expression library 
immunization (ELI) is analogous to expression cloning and involves reducing a genomic 
expression library of a pathogen, e.g., H. pylori^ into plasmids that can act as genetic 
vaccines. The plasmids can also be designed to encode genetic adjuvants which can 

20 dramatically stimulate the humoral response. These genetic adjuvants can be introduced 
at remote sites and act as well extracelluraly as intracellularly. 

This is a new approach to vaccine production that has many of the advantages of 
live/attenuated pathogens but no risk of infection. An expression library of pathogen 
DNA is used to immunize a host thereby producing the effects of antigen presentation of 

25 a live vaccine without the risk. For example, in the present invention, random fragments 
from the H. pylori genome or from cosmid or plasmid clones, as well as PCR products 
from genes identified by genomic sequencing, can be used to immunize a host. The 
feasibility of this approach has been demonstrated with Mycoplasma pulmonis (Barry et 
al.. Nature 377:632-635, 1995), where even partial expression libraries of Mycoplasma 

30 pulmonis, a natural pathogen in rodents, provided protection against challenge from the 
pathogen. 

ELI is a technique that allows for production of a non-infectious multipartite 
vaccine, even when little is known about pathogen's biology, because ELI uses the 
immune system to screen candidate genes. Once isolated, these genes can be used as 
35 genetic vaccines or for development of recombinant protein vaccines. Thus. ELI allows 
for production of vaccines in a systematic, largely mechanized fashion. 
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Screening immunogenic components can be accomplished using one or more of 
several different assays. For example, in vitro, peptide T cell stimulatory activity is 
assayed by contacting a peptide known or suspected of being immunogenic with an 
antigen presenting cell which presents appropriate MHC molecules in a T cell culture. 
Presentation of an immunogenic H. pylori peptide in association with appropriate MHC 
molecules to T cells in conjunction with the necessary costimulation has the effect of 
transmitting a signal to the T cell that induces the production of increased levels of 
cytokines, particularly of interleukin-2 and interleukin-4. The culture supernatant can be 
obtained and assayed for interleukin-2 or other known cytokines. For example, any one 
of several conventional assays for interleukin-2 can be employed, such as the assay 
described in Proc. Natl Acad. Sci USA, 86: 1333 (1989) the pertinent portions of which 
are incorporated herein by reference. A kit for an assay for the production of interferon 
is also available from Genzyme Corporation (Cambridge, MA). 

Alternatively, a common assay for T cell proliferation entails measuring tritiated 
thymidine incorporation. The proliferation of T cells can be measured in vitro by 
determining the amount of ^H-labeled thymidine incorporated into the replicating DNA 
of cultured cells. Therefore, the rate of DNA synthesis and, in turn, the rate of cell 
division can be quantified. 

Vaccine compositions of the invention containing immunogenic components 
(e.g., H. pylori polypeptide or fragment thereof or nucleic acid encoding an H. pylori 
polypeptide or fragment thereof) preferably include a pharmaceutically acceptable 
carrier. The term "pharmaceutically acceptable carrier" refers to a carrier that does not 
cause an allergic reaction or other untoward effect in patients to whom it is administered. 
Suitable pharmaceutically acceptable carriers include, for example, one or more of 
water, saline, phosphate buffered saline, dextrose, glycerol, ethanol and the like, as well 
as combinations thereof. Pharmaceutically acceptable carriers may further comprise 
minor amounts of auxiliary substances such as wetting or emulsifying agents, 
preservatives or buffers, which enhance the shelf life or effectiveness of the antibody. 
For vaccines of the invention containing K pylori polypeptides, the polypeptide is 
coadministered with a suitable adjuvant. 

It will be apparent to those of skill in the art that the therapeutically effective 
amount of DNA or protein of this invention will depend, inter alia, upon the 
administration schedule, the unit dose of antibody administered, whether the protein or 
DNA is administered in combination with other therapeutic agents, the immune status 
and health of the patient, and the therapeutic activity of the particular protein or DNA. 
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Vaccine compositions are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Methods for intramuscular 
immunization are described by Wolff et al. (1990) Science 247 : 1465-1468 and by 
Sedegah et al. (1994) Immunology 91: 9866-9870. Other modes of administration 
5 include oral and pulmonary formulations, suppositories, and transdermal applications. 
Oral immunization is preferred over parenteral methods for inducing protection against 
infection by H. pylori. Czinn et. al. (1993) Vaccine 1 1: 637-642. Oral formulations 
include such normally employed excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium 
10 carbonate, and the like. 

The vaccine compositions of the invention can include an adjuvant, including, 
but not limited to aluminum hydroxide; N-acetyl-muramyl— L-threonyl-D-isoglutamine 
(thr-MDP); N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 1 1637, referred to as 
nor-MDP); N-acetylmuramy 1-L-alany l-D-isoglutaminyl-L-alanine-2-( 1 '-2'-dipalmitoy 1- 
15 sn-glycero-3-hydroxyphos-phoryloxy)-ethylamine (CGP 19835 A, referred to a MTP- 
PE); RIBI, which contains three components from bacteria; monophosphoryl lipid A; 
trehalose dimycoloate; cell wall skeleton (MPL + TDM + CWS) in a 2% 
squalene/Tween 80 emulsion; and cholera toxin. Others which may be used are non- 
toxic derivatives of cholera toxin, including its B subunit, and/or conjugates or 
20 genetically engineered fusions of the K pylori polypeptide with cholera toxin or its B 
subunit, procholeragenoid, fungal polysaccharides, including schizophyllan, muramyl 
dipeptide, muramyl dipeptide derivatives, phorbol esters, labile toxin of E. coli, non-//. 
pylori bacterial lysates, block polymers or saponins. 

Other suitable delivery methods include biodegradable microcapsules or 
25 immuno-stimulating complexes (ISCOMs) or liposomes, genetically engineered 

attenuated live vectors such as viruses or bacteria, and recombinant (chimeric) virus-like 
particles, e.g., bluetongue. The amount of adjuvant employed will depend on the type of 
adjuvant used. For example, when the mucosal adjuvant is cholera toxin, it is suitably 
used in an amount of 5 |ig to 50 j^g, for example 1 0 jag to 35 ng. When used in the form 
30 of microcapsules, the amount used will depend on the amount employed in the matrix of 
the microcapsule to achieve the desired dosage. The determination of this amount is 
within the skill of a person of ordinary skill in the art. 

Carrier systems in humans may include enteric release capsules protecting the 
antigen from the acidic environment of the stomach, and including K pylori polypeptide 
35 in an insoluble form as fusion proteins. Suitable carriers for the vaccines of the 
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invention are enteric coated capsules and polylactide-glycolide microspheres. Suitable 
diluents are 0.2 N NaHC03 and/or saline. 

Vaccines of the invention can be administered as a primary prophylactic agent in 
adults or in children, as a secondary prevention, after successful eradication of//, pylori 
5 in an infected host, or as a therapeutic agent in the aim to induce an immune response in 
a susceptible host to prevent infection by K pylori. The vaccines of the invention are 
administered in amounts readily determined by persons of ordinary skill in the art. 
Thus, for adults a suitable dosage will be in the range of 10 fig to 10 g, preferably 10 jig 
to 100 mg, for example 50 ng to 50 mg. A suitable dosage for adults will also be in the 

10 range of 5 ^ig to 500 mg. Similar dosage ranges will be applicable for children. Those 
skilled in the art will recognize that the optimal dose may be more or less depending 
upon the patient's body weight, disease, the route of administration, and other factors. 
Those skilled in the art will also recognize that appropriate dosage levels can be 
obtained based on results with known oral vaccines such as, for example, a vaccine 

1 5 based on an E. coli lysate (6 mg dose daily up to total of 540 mg) and with an 

enterotoxigenic E. coli purified antigen (4 doses of 1 mg) (Schulman et ah, J. Urol 
150:917-921 (1993); Boedecker et al., American Gastroenterological Assoc. 999:A-222 
(1993)). The number of doses will depend upon the disease, the formulation, and 
efficacy data from clinical trials. Without intending any limitation as to the course of 

20 treatment, the treatment can be administered over 3 to 8 doses for a primary 

immunization schedule over 1 month (Boedeker, American Gastroenterological Assoc, 
888:A-222(1993)). 

It will be apparent to those skilled in the art that some of the vaccine 
compositions of the invention are usefuls only for preventing K pylori infection, some 

25 are useful only for treating H. pylori infection, and some are useful for both preventing 
and treating K pylori infection. In a preferred embodiment, the vaccine composition of 
the invention provides protection against H. pylori infection by stimulating humoral 
and/or cell-mediated immunity against H. pylori. It should be understood that 
amelioration of any of the symptoms of K pylori infection is a desirable clinical goal, 

30 including a lessening of the dosage of medication used to treat H. pylori-caused disease. 

VII. Antibodies Reactive With H. pylori Polypeptides 

The invention also includes antibodies specifically reactive with the subject H. 
pylori polypeptide. Anti-protein/anti-peptide antisera or monoclonal antibodies can be 
35 made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by 
Harlow and Lane (Cold Spring Harbor Press: 1988)). A mammal such as a mouse, a 
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hamster or rabbit can be immunized with an immunogenic form of the peptide. 
Techniques for conferring immunogenicity on a protein or peptide include conjugation 
to carriers or other techniques well known in the art. An immunogenic portion of the 
subject K pylori polypeptide can be administered in the presence of adjuvant. The 
5 progress of immunization can be monitored by detection of antibody titers in plasma or 
serum. Standard ELISA or other immunoassays can be used with the immunogen as 
antigen to assess the levels of antibodies. 

In a preferred embodiment, the subject antibodies are tmmunospecific for 
antigenic determinants of the K pylori polypeptides of the invention, e.g. antigenic 
10 determinants of a polypeptide shown in the Sequence Listing, or a closely related human 
or non-human mammalian homolog (e.g., 90% homologous, more preferably at least 
95% homologous). In yet a further preferred embodiment of the invention, the anti-//. 
pylori antibodies do not substantially cross react (i.e., react specifically) with a protein 
which is for example, less than 80% percent homologous to a sequence shown in the 
1 5 Sequence Listing. By "not substantially cross react", it is meant that the antibody has a 
binding affinity for a non-homologous protein which is less than 10 percent, more 
preferably less than 5 percent, and even more preferably less than 1 percent, of the 
binding affinity for a protein contained in the Sequence Listing. In a most preferred 
embodiment, there is no crossreactivity between bacterial and mammalian antigens. 
20 The term antibody as used herein is intended to include fragments thereof which 

are also specifically reactive with H. pylori polypeptides. Antibodies can be fragmented 
using conventional techniques and the fragments screened for utility in the same manner 
as described above for whole antibodies. For example, F(ab')2 fragments can be 
generated by treating antibody with pepsin. The resulting F(ab')2 fragment can be 
25 treated to reduce disulfide bridges to produce Fab 1 fragments. The antibody of the 
invention is further intended to include bispecific and chimeric molecules having an 
anti-//. pylori portion. 

Both monoclonal and polyclonal antibodies (Ab) directed against H. pylori 
polypeptides or H. pylori polypeptide variants, and antibody fragments such as Fab* and 
30 F(ab )2, can be used to block the action of H. pylori polypeptide and allow the study of 
the role of a particular H. pylori polypeptide of the invention in aberrant or unwanted 
intracellular signaling, as well as the normal cellular function of the H. pylori and by 
microinjection of anti-//. pylori polypeptide antibodies of the present invention. 
Antibodies which specifically bind H. pylori epitopes can also be used in 
35 immunohistochemical staining of tissue samples in order to evaluate the abundance and 
pattern of expression of//, pylori antigens. Anti H. pylori polypeptide antibodies can be 
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used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate 
H. pylori levels in tissue or bodily fluid as part of a clinical testing procedure. Likewise, 
the ability to monitor H. pylori polypeptide levels in an individual can allow 
determination of the efficacy of a given treatment regimen for an individual afflicted 
5 with such a disorder. The level of an H. pylori polypeptide can be measured in cells 
found in bodily fluid, such as in urine samples or can be measured in tissue, such as 
produced by gastric biopsy. Diagnostic assays using anti-//. pylori antibodies can 
include, for example, immunoassays designed to aid in early diagnosis of K pylori 
infections. The present invention can also be used as a method of detecting antibodies 
1 0 contained in samples from individuals infected by this bacterium using specific K pylori 
antigens. 

Another application of anti-//. pylori polypeptide antibodies of the invention is 
in the immunological screening of cDNA libraries constructed in expression vectors 
such as >.gtl 1 , A,gtl 8-23, XZAP, and A.ORF8. Messenger libraries of this type, having 

1 5 coding sequences inserted in the correct reading frame and orientation, can produce 

fusion proteins. For instance, Xgtl 1 will produce fusion proteins whose amino termini 
consist of fi-galactosidase amino acid sequences and whose carboxy termini consist of a 
foreign polypeptide. Antigenic epitopes of a subject H. pylori polypeptide can then be 
detected with antibodies, as, for example, reacting nitrocellulose filters lifted from 

20 infected plates with anti-//. pylori polypeptide antibodies. Phage, scored by this assay, 
can then be isolated from the infected plate. Thus, the presence of H. pylori gene 
homologs can be detected and cloned from other species, and alternate isoforms 
(including splicing variants) can be detected and cloned. 

25 VIII. Kits Containing Nucleic Acids, Polypeptides or Antibodies of the Invention 

The nucleic acid, polypeptides and antibodies of the invention can be combined 
with other reagents and articles to form kits. Kits for diagnostic purposes typically 
comprise the nucleic acid, polypeptides or antibodies in vials or other suitable vessels. 
Kits typically comprise other reagents for performing hybridization reactions, 

30 polymerase chain reactions (PGR), or for reconstitution of lyophilized components, such 
as aqueous media, salts, buffers, and the like. Kits may also comprise reagents for 
sample processing such as detergents, chaotropic salts and the like. Kits may also 
comprise immobilization means such as particles, supports, wells, dipsticks and the like. 
Kits may also comprise labeling means such as dyes, developing reagents, radioisotopes, 

35 fluorescent agents, luminescent or chemiluminescent agents, enzymes, intercalating 

agents and the like. With the nucleic acid and amino acid sequence information provided 



BNS page 77 



WO 97/37044 



PCTYUS97/05223 



-76- 

herein, individuals skilled in art can readily assemble kits to serve their particular 
purpose. Kits further can include instructions for use. 

IX. Drug Screening Assays Using H. pylori Polypeptides 
5 By making available purified and recombinant H. pylori polypeptides, the 

present invention provides assays which can be used to screen for drugs which are either 
agonists or antagonists of the normal cellular function, in this case, of the subject K 
pylori polypeptides, or of their role in intracellular signaling. Such inhibitors or 
potentiators may be useful as new therapeutic agents to combat H. pylori infections in 

10 humans. A variety of assay formats will suffice and, in light of the present inventions, 
will be comprehended by the skilled artisan. 

In many drug screening programs which test libraries of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of 
compounds surveyed in a given period of time. Assays which are performed in cell-free 

1 5 systems, such as may be derived with purified or semi-purified proteins, are often 

preferred as "primary" screens in that they can be generated to permit rapid development 
and relatively easy detection of an alteration in a molecular target which is mediated by 
a test compound. Moreover, the effects of cellular toxicity and/or bioavailability of the 
test compound can be generally ignored in the in vitro system, the assay instead being 

20 focused primarily on the effect of the drug on the molecular target as may be manifest in 
an alteration of binding affinity with other proteins or change in enzymatic properties of 
the molecular target. Accordingly, in an exemplary screening assay of the present 
invention, the compound of interest is contacted with an isolated and purified H. pylori 
polypeptide. 

25 Screening assays can be constructed in vitro with a purified H. pylori 

polypeptide or fragment thereof, such as an H, pylori polypeptide having enzymatic 
activity, such that the activity of the polypeptide produces a detectable reaction product. 
The efficacy of the compound can be assessed by generating dose response curves from 
data obtained using various concentrations of the test compound. Moreover, a control 

30 assay can also be performed to provide a baseline for comparison. Suitable products 
include those with distinctive absorption, fluorescence, or chemi-luminescence 
properties, for example, because detection may be easily automated. A variety of 
synthetic or naturally occurring compounds can be tested in the assay to identify those 
which inhibit or potentiate the activity of the //. pylori polypeptide. Some of these 

35 active compounds may directly, or with chemical alterations to promote membrane 
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permeability or solubility, also inhibit or potentiate the same activity (e.g., enzymatic 
activity) in whole, live H. pylori cells. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references and published patent 
5 applications cited throughout this application are hereby incorporated by reference. 

Other Embodiments 

Many of the nucleic acids and corresponding polypeptides of the invention were 

disclosed previously in the parent applications, U.S. S.N. 08/761,3 18, filed December 6, 
10 1996 (Attorney Docket No.: GTN-009CP2), U.S.S.N. 08/736,905, filed October 25, 

1996 (Attorney Docket No.: GTN-010CP) and U.S.S.N. 08/738,859, filed October 28, 

1996 (Attorney Docket No.: GTN-009CP), which are a continuation-in-part of U.S.S.N. 

08/625,81 1, filed March 29, 1996 (Attorney Docket No.: GTN-009), and U.S.S.N. 

08/758,731, filed April 2, 1996 (Attorney Docket No.: GTN-010). The correlation 
15 between sequence identification numbers in the above-identified parent applications and 

sequence identification numbers provided herein is outlined in Table 7 below. 
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EXEMPLIFICATION 

5 I. Cloning and Sequencing of H. pylori DNA 

H. pylori chromosomal DNA was isolated according to a basic DNA protocol 
outlined in Schleif R.F. and Wensink P.C., Practical Methods in Molecular Biology^ 
p.98, Springer- Verlag, NY., 1981, with minor modifications. Briefly, cells were 
pelleted, resuspended in TE (10 mM Tris, 1 mM EDTA, pH 7.6) and GES lysis buffer 

10 (5.1 M guanidium thiocyanate, 0.1 M EDTA, pH 8.0, 0.5% N-Iaurylsarcosine) was 
added. Suspension was chilled and ammonium acetate (NH 4 Ac) was added to final 
concentration of 2.0 M. DNA was extracted, first with chloroform, then with phenol- 
chloroform, and reextracted with chloroform, DNA was precipitated with isopropanol, 
washed twice with 70% EtOH, dried and resuspended in TE. 

1 5 Following isolation whole genomic H. pylori DNA was nebulized (Bodenteich et 

al., Automated DNA Sequencing and Analysis (J.C. Venter, ed.), Academic Press, 1994) 
to a median size of 2000 bp. After nebulization, the DNA was concentrated and 
separated on a standard 1% agarose gel. Several fractions, corresponding to 
approximate sizes 900-1300 bp, 1300-1700 bp, 1700-2200 bp, 2200-2700 bp, were 

20 excised from the gel and purified by the GeneClean procedure (Biol 01, Inc.). 

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. 
The healed DNA was then ligated to unique BstXI-linker adopters in 1 00-1000 fold 
molar excess. These linkers are complimentary to the BstXI-cut pMPX vectors., while 
the overhang is not self-complimentary. Therefore, the linkers will not concatemerize 

25 nor will the cut-vector religate itself easily. The linker-adopted inserts were separated 

from the unincorporated linkers on a 1% agarose gel and purified using GeneClean. The 
linker-adopted inserts were then ligated to each of the 20 pMPX vectors to construct a 
series of "shotgun" subclone libraries. The vectors contain an out-of-frame lacZ gene at 
the cloning site which becomes in-frame in the event that an adapter-dimer is cloned, 

30 allowing these to be avoided by their blue-color. 

All subsequent steps were based on the multiplex DNA sequencing protocols 
outlined in Church G.M. and Kieffer-Higgins S., Science 240:185-188, 1988. Only 
major modifications to the protocols are highlighted. Briefly, each of the 20 vectors was 
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then transformed into DH5a competent cells (Gibco/BRL, DH5a transformation 
protocol). The libraries were assessed by plating onto antibiotic plates containing 
ampicillin, methicillin and IPTG/Xgal. The plates were incubated overnight at 37°C. 
Successful transformants were then used for plating of clones and pooling into the 
5 multiplex pools. The clones were picked and pooled into 40 ml growth medium 
cultures. The cultures were grown overnight at 37°C. DNA was purified using the 
Qiagen Midi-prep kits and Tip-100 columns (Qiagen, Inc.). In this manner, 100 tig of 
DNA was obtained per pool. 1 5 96- well plates of DNA were generated to obtain a 5-1 0 
fold sequence redundancy with 250-300 base average read-lengths. 

1 0 These purified DNA samples were then sequenced using the multiplex DNA 

sequencing based on chemical degradation methods (Church G.M. and Kieffer-Higgins 
S., Science 240: 1 85-188, 1 988) or by Sequithrem (Epicenter Technologies) dideoxy 
sequencing protocols. The sequencing reactions were electrophoresed and transferred 
onto nylon membranes by direct transfer electrophoresis from 40 cm gels (Richterich P. 

15 and Church G.M., Methods in Enzemology 218:187-222, 1993) or by electroblotting 

(Church, supra), 24 samples were run per gel. 45 successful membranes were produced 
by chemical sequencing and 8 were produced by dideoxy sequencing. The DNA was 
covalently bound to the membranes by exposure to ultraviolet light, and hybridized with 
labeled oligonucleotides complimentary to tag sequences on the vectors (Church, supra). 

20 The membranes were washed to rinse off non-specifically bound probe, and exposed to 
X-ray film to visualize individual sequence ladders. After autoradiography, the 
hybridized probe was removed by incubation at 65° C, and the hybridization cycle 
repeated with another tag sequence until the membrane has been probed 38 times for 
chemical sequencing membranes and 10 times for the dideoxy sequencing membranes. 

25 Thus, each gel produced a large number of films, each containing new sequencing 

information. Whenever a new blot was processed, it was initially probed for an internal 
standard sequence added to each of the pools. 

Digital images of the films were generated using a laser-scanning densitometer 
(Molecular Dynamics, Sunnyvale, CA). The digitized images were processed on 

30 computer workstations (VaxStation 4000's) using the program REPLICA™ (Church et 
al., Automated DNA Sequenicng and Analysis (J.C. Venter, ed.), Academic Press, 1994). 
Image processing included lane straightening, contrast adjustment to smooth out 
intensity differences, and resolution enhancement by iterative gaussian decon volution. 
The sequences were then automatically picked in REPLICA™ and displayed for 

35 interactive proofreading before being stored in a project database. The proofreading was 
accomplished by a quick visual scan of the film image followed by mouse clicks on the 
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bands of the displayed image to modify the base calls. For typical sequences derived by 
chemical sequencing, the error rate of the REPLICA™ base calling software was 2-5% 
with most errors occurring near the end of a sequence read. Many of the sequence errors 
could be detected and corrected because multiple sequence reads covering the same 
5 portion of the genomic DNA provide adequate sequence redundancy for editing. Each 
sequence automatically received a number correspond to (microtiter plate and probe 
information) and lane set number (corresponding to microtiter plate columns). This 
number serves as a permanent identifier of the sequence so it is always possible to 
identify the original of any particular sequence without recourse to a specialized 
1 0 database. 

Routine assembly of H. pylori sequences was done using the program FALCON 
(Church, Church et al., Automated DNA Sequenicng and Analysis (J.C. Venter, ed.), 
Academic Press, 1994). This program has proven to be fast and reliable for most 
sequences. The assembled contigs were displayed using a modified version of 

1 5 GelAssemble, developed by the Genetics Computer Group (GCG) (Devereux et al., 

Nucleic Acid Res. 12:387-95, 1984) that interacts with REPLICA™. This provided for 
an integrated editor that allows multiple sequence gel images to be instantaneously 
called up from the REPLICA™ database and displayed to allow rapid scanning of 
contigs and proofreading of gel traces where discrepancies occurred between different 

20 sequence reads in the assembly. 

II. Identification, Cloning and Expression of H. pylori Nucleic Acids 

Expression and purification of the H. pylori polypeptides of the invention can be 
performed essentially as outlined below. 

25 To facilitate the cloning, expression and purification of membrane and secreted 

proteins from K pylori, a gene expression system, such as the pET System (Novagen), 
for cloning and expression of recombinant proteins in E. coli, is selected. Also, a DNA 
sequence encoding a peptide tag, the His-Tag, is fused to the 3' end of DNA sequences 
of interest in order to facilitate purification of the recombinant protein products. The 3' 

30 end is selected for fusion in order to avoid alteration of any 5' terminal signal sequence. 
The exception to the above is ppiB, a gene cloned for use as a control in the expression 
studies. The sequence for H. pylori ppiB contains a DNA sequence encoding a His-Tag 
fused to the 5' end of the full length gene, because the protein product of this gene does 
not contain a signal sequence and is expressed as a cytosolic protein. 
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PCR Amplification and Cloning of Nucleic Acids Containing ORF's for 
Membrane and Secreted Polypeptides from H. pylori 
Nucleic acids chosen (for example, from the nucleic acids set forth in the 
Sequence Listing) for cloning from the J99 strain of H. pylori are prepared for 

5 amplification cloning by polymerase chain reaction (PCR). Synthetic oligonucleotide 
primers specific for the 5' and 3' ends of open reading frames (ORFs) are designed and 
purchased from GibcoBRL Life Technologies (Gaithersburg, MD, USA). All forward 
primers (specific for the 5' end of the sequence) are designed to include an Ncol cloning 
site at the extreme 5' terminus. These primers are designed to permit initiation of 

1 0 protein translation at a methionine residue followed by a valine residue and the coding 
sequence for the remainder of the native H. pylori DNA sequence. All reverse primers 
(specific for the 3' end of any K pylori ORF) include a EcoRI site at the extreme 5' 
terminus to permit cloning of each K pylori sequence into the reading frame of the pET- 
28b. The pET-28b vector provides sequence encoding an additional 20 carboxy- 

1 5 terminal amino acids including six histidine residues (at the extreme C-terminus), which 
comprise the His-Tag. An exception to the above, as noted earlier, is the vector 
construction for the ppiB gene. A synthetic oligonucleotide primer specific for the 5' 
end of ppiB gene encodes a BamHI site at its extreme 5' terminus and the primer for the 
3' end of the ppiB gene encodes a Xhol site at its extreme 5' terminus. 

20 Genomic DNA prepared from the J99 strain of H. pylori is used as the source of 

template DNA for PCR amplification reactions (Current Protocols in Molecular 
Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). To amplify a DNA 
sequence containing an //. pylori ORF, genomic DNA (50 nanograms) is introduced into 
a reaction vial containing 2 mM MgCl2> 1 micromolar synthetic oligonucleotide primers 

25 (forward and reverse primers) complementary to and flanking a defined H. pylori ORF, 
0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP, dCTP, dTTP and 2.5 units 
of heat stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, 
NJ, USA) in a final volume of 100 microliters. 

Upon completion of thermal cycling reactions, each sample of amplified DNA is 

30 washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, 

Gaithersburg, MD, USA). All amplified DNA samples are subjected to digestion with 
the restriction endonucleases, e.g., Ncol and EcoRI (New England BioLabs, Beverly, 
MA, USA)(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. 
Ausubel et al., eds.„ 1994). DNA samples are then subjected to electrophoresis on 1 .0 % 

35 NuSeive (FMC BioProducts, Rockland, ME USA) agarose gels. DNA is visualized by 
exposure to ethidium bromide and long wave uv irradiation. DNA contained in slices 
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isolated from the agarose gel is purified using the Bio 101 GeneClean Kit protocol (Bio 
101 Vista, CA,USA) 

Cloning of H. pylori Nucleic Acids Into an Expression Vector 
The pET-28b vector is prepared for cloning by digestion with endonucleases, 
5 e.g., Ncol and EcoRI (Current Protocols in Molecular Biology, John Wiley and Sons, 
Inc., F. Ausubel et al., eds., 1994). In the case of cloning ppiB, the pET-28a vector, 
which encodes a His-Tag that can be fused to the 5 1 end of an inserted gene, is used and 
the cloning site prepared for cloning with the ppiB gene by digestion with BamHI and 
Xhol restriction endonucleases. 

10 Following digestion, DNA inserts are cloned (Current Protocols in Molecular 

Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994) into the previously 
digested pET-28b expression vector, except for the amplified insert for ppiB, which is 
cloned into the pET-28a expression vector. Products of the ligation reaction are then 
used to transform the BL21 strain of E. coli (Current Protocols in Molecular Biology, 

15 John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994) as described below. 

Transformation Of Competent Bacteria With Recombinant Plasmids 
Competent bacteria, E coli strain BL21 or E. coli strain BL21(DE3), are 
transformed with recombinant pET expression plasmids carrying the cloned H. pylori 
sequences according to standard methods (Current Protocols in Molecular, John Wiley 

20 and Sons, Inc., F. Ausubel et al., eds., 1994). Briefly, 1 microliter of ligation reaction is 
mixed with 50 microliters of electrocompetent cells and subjected to a high voltage 
pulse, after which, samples are incubated in 0.45 milliliters SOC medium (0.5% yeast 
extract, 2.0 % tryptone, 10 mM NaCl, 2.5 mM KC1, 10 mM MgC12, 10 mM MgS04 and 
20, mM glucose) at 37°C with shaking for I hour. Samples are then spread on LB agar 

25 plates containing 25 microgram/ml kanamycin sulfate for growth overnight. 

Transformed colonies of BL21 are then picked and analyzed to evaluate cloned inserts 
as described below. 

Identification Of Recombinant Expression Vectors With H. Pylori Nucleic Acids 
Individual BL21 clones transformed with recombinant pET-28b-H.pylori ORFs 
30 are analyzed by PCR amplification of the cloned inserts using the same forward and 

reverse primers, specific for each H. pylori sequence, that were used in the original PCR 
amplification cloning reactions. Successful amplification verifies the integration of the 
K pylori sequences in the expression vector (Current Protocols in Molecular Biology, 
John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). 
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Isolation and Preparation of Nucleic Acids From Transformants 
Individual clones of recombinant pET-28b vectors carrying properly cloned H. 
pylori ORFs are picked and incubated in 5 mis of LB broth plus 25 microgram/ml 
kanamycin sulfate overnight. The following day plasmid DNA is isolated and purified 
5 using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA, USA). 
Expression Of Recombinant H. Pylori Sequences In E. coli 
The pET vector can be propagated in any E. coli K-12 strain e.g. HMS1 74, 
HB101, JM109, DH5, etc. for the purpose of cloning or plasmid preparation. Hosts for 
expression include E. coli strains containing a chromosomal copy of the gene for T7 
10 RNA polymerase. These hosts are lysogens of bacteriophage DE3, a lambda derivative 
that carries the lad gene, the lacUVS promoter and the gene for T7 RNA polymerase. 
T7 RNA polymerase is induced by addition of isopropyl-B-D-thiogalactoside (IPTG), 
and the T7 RNA polymerase transcribes any target plasmid, such as pET-28b, carrying 
its gene of interest. Strains used include: BL21(DE3) (Studier, F.W., Rosenberg, A.H., 
15 Dunn, J. J., and Dubendorff, J.W. (1990) Meth. Enzymol. 185, 60-89). 

To express recombinant H. pylori sequences, 50 nanograms of plasmid DNA 
isolated as described above is used to transform competent BL21(DE3) bacteria as 
described above (provided by Novagen as part of the pET expression system kit). The 
lacZ gene (beta-galactosidase) is expressed in the pET-System as described for the H. 
20 pylori recombinant constructions. Transformed cells are cultured in SOC medium for 1 
hour, and the culture is then plated on LB plates containing 25 micrograms/ml 
kanamycin sulfate. The following day, bacterial colonies are pooled and grown in LB 
medium containing kanamycin sulfate (25 micrograms/ml) to an optical density at 600 
nM of 0.5 to 1.0 O.D. units, at which point, 1 millimolar IPTG was added to the culture 
25 for 3 hours to induce gene expression of the H. pylori recombinant DNA constructions . 
After induction of gene expression with IPTG, bacteria are pelleted by 
centrifiigation in a Sorvall RC-3B centrifuge at 3500 x g for 15 minutes at 4°C. Pellets 
are resuspended in 50 milliliters of cold 10 mM Tris-HCl, pH 8.0, 0.1 M NaCl and 0.1 
mM EDTA (STE buffer). Cells are then centrifiiged at 2000 x g for 20 min at 4°C. Wet 
30 pellets are weighed and frozen at -80°C until ready for protein purification. 

HI. Purification Of Recombinant Proteins From E. Coli 
Analytical Methods 

The concentrations of purified protein preparations are quantified 
35 spectrophotometrically using absorbance coefficients calculated from amino acid 

content (Perkins, S.J. 1986 Eur. J. Biochem. 157, 169-180). Protein concentrations are 
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also measured by the method of Bradford, M.M. (1976) Anal. Biochem. 72, 248-254, 
and Lowry, O.H., Rosebrough, N., Farr, A.L. & Randall, R.J. (1951) J. Biol. Chem. 193, 
pages 265-275, using bovine serum albumin as a standard. 

SDS-polyacrylamide gels (12% or 4.0 to 25 % acrylamide gradient gels) are 
5 purchased from BioRad (Hercules, CA, USA), and stained with Coomassie blue. 

Molecular weight markers include rabbit skeletal muscle myosin (200 kDa), £. coli (- 
galactosidase (116 kDa), rabbit muscle phosphorylase B (97.4 kDa), bovine serum 
albumin (66.2 kDa), ovalbumin (45 kDa), bovine carbonic anhydrase (3 1 kDa), soybean 
trypsin inhibitor (21.5 kDa), egg white lysozyme (14.4 kDa) and bovine aprotinin (6.5 
10 kDa). 

1. Purification of soluble proteins 

All steps are carried out at 4°C. Frozen ceils are thawed, resuspended in 5 
volumes of lysis buffer (20 mM Tris, pH 7.9, 0.5 M NaCl, 5 mM imidazole with 10% 
glycerol, 0.1 % -mercaptoethanol, 200 (g/ ml lysozyme, 1 mM phenylmethylsulfonyl 
15 fluoride (PMSF), and 10 ug/ml each of leupeptin, aprotinin, pepstatin, L-l-chloro-3-[4- 
tosy lamido]-7-amino-2-heptanone (TLCK), L- 1 -chloro-3-[4-tosylamido]-4-pheny 1-2- 
butanone (TPCK), and soybean trypsin inhibitor, and ruptured by several passages 
through a small volume microfluidizer (Model M-l 10S, Microfluidics International 
Corporation, Newton, MA). The resultant homogenate is made 0.1 % Brij 35, and 
20 centrifuged at 100,000 x g for 1 hour to yield a clear supernatant (crude extract). 

Following filtration through a 0.8 (m Supor filter (Gelman Sciences, FRG) the 
crude extract is loaded directly onto a Ni 2+ " nitrolotriacetate-agarose (NTA) with a 5 
milliliter bed volume (Hochuli, E., Dbeli, H., and Schacheer, A. (1987) J. 
Chromatography 41 1, 177-184) pre-equilibrated in lysis buffer containing 10 % 
25 glycerol, 0.1 % Brij 35 and 1 mM PMSF. The column is washed with 250 ml (50 bed 
volumes) of lysis buffer containing 10 % glycerol, 0.1 % Brij 35, and are eluted with 
sequential steps of lysis buffer containing 10 % glycerol, 0.05 % Brij 35, 1 mM PMSF, 
and: either 20, 100, 200, or 500 mM imidazole. Fractions are monitored by absorbance 
at OD2go nm, and peak fractions are analyzed by SDS-PAGE. 
30 2. Purification of insoluble proteins from inclusion bodies 

The following steps are carried out at 4°C. Cell pellets are resuspended in lysis 
buffer with 10% glycerol 200 (g/ ml lysozyme, 5 mM EDTA, ImM PMSF and 0.1 % - 
mercaptoethanol. After passage through the cell disrupter, the resulting homogenate is 
made 0.2 % deoxycholate, stirred 10 minutes, then centrifuged at 20,000 x g, for 30 min. 
35 The pellets are washed with lysis buffer containing 10 % glycerol, 10 mM EDTA, 1% 
Triton X-100, 1 mM PMSF and 0.1% -mercaptoethanol, followed by several washes 
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with lysis buffer containing 1 M urea, 1 mM PMSF and 0.1 % -mercaptoethanol. The 
resulting white pellet is composed primarily of inclusion bodies, free of unbroken cells 
and membranous materials. 

Dialysis and concentration of protein samples 
5 Urea is removed slowly from the protein samples by dialysis against Tris- 

buffered saline (TBS; 10 mM Tris pH 8.0, 150 mM NaCl) containing 0.5 % 
deoxycholate (DOC) with sequential reduction in urea as follows; 6M, 4M, 3M, 2M, 
1M, 0.5 M and finally TBS without any urea. Each dialysis step is conducted for a 
minimum of 4 hours at room temperature. 
1 0 After dialysis, samples are concentrated by pressure filtration using Amicon 

stirred-cells. Protein concentrations are measured using the methods of Perkins (1986 
Eur. J. Biochem. 157, 169-180), Bradford ((1976) Anal. Biochem. 72, 248-254) and 
Lowry ((1951) J. Biol. Chem. 193, pages 265-275). 

15 IV. Assessment Of The Antigenicity Of Outer Membrane Localized Antigens Of H. 
pylori 

Purification of outer membranes form H. pylori can be performed by essentially 
foilwing the protocol outlined below. 

K pylori strains J99 (ATCC# 55679) and Ah244 are grown on chocolate blood 

20 agar containing 5% (vol/vol) horse blood, at 37(C in an atmosphere containing 10% 

CO2 for 48 h. Bacteria were harvested by suspension in 20 mM Tris, pH 7.5. The cells 
are collected by centrifugation at 12,000 Xg, for 20 min at 4(C and washed 3 times with 
20 mM Tris, pH 7.5. Cells are suspended in 20 mM Tris, pH 7.5 and broken by 
sonication on ice (eight bursts of 30 s at 60 watts with 60 s pauses between bursts). 

25 DNase (0.1 mg) and RNase (0.5 mg) are added to the cell suspension, and the mixture is 
incubated for 30 minutes at room temperature. The cell suspension is centrifuged at 
12,000 Xg for 20 min, at 4(C. The supernatant was retained and centrifuged again. Total 
membranes are collected from the supernatant by centrifugation at 40,000 Xg for 30 
minute, at 4°C. The pellet are washed twice in 20 mM Tris, pH 7.5. The protein 

30 content is assayed using the Bradford protein assay, with bovine serum albumin (BSA) 
as a standard. The suspension is then adjusted to 1 mg protein /ml. The solubilization 
of the membranes is realized by adding N-lauryl-sarcosine to this suspension in a ratio 
of 6 mg of N-lauryl-sarcosine per mg of protein. The suspension is incubated for 30 
minutes at room temperature in presence of N-lauryl-sarcosine. Outer membranes are 

35 collected by centrifugation at 40,000 Xg for 30 minutes at 4°C. The pellet is washed 3 
times with Milli Q quality water, aliquoted and stored at -20°C until use. 
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Identijication Of Outer Membrane Antigens of K pylori 
Outer membrane antigens can be identified using a protocol outlined below. 
Proteins are separated on sodium dodecyl sulfate polyacrylamide gels (SDS- 
PAGE) according to the method described by Laemmlu U.K. (1970) Nature (London) 
5 Volume 227, 680-685. Samples are prepared by suspension in standard treatment buffer 
and heated at 100°C for 10 min. Approximately 1-5 mg of protein is loaded per well on 
8X10 cm minigels (0.75 mm). The separated proteins are then transferred to PVDF 
membranes as described below. 

Electroblotting of separated proteins to PVDF membranes is performed in a Bio 
10 Rad Mini-Trans Blot Electrophoretic Transfer cell. The PVDF membrane Immobilon- 
pSQ is employed. Electroblotting is carried out for 60 min at 50V using CAPS transfer 
buffer (lOmM 3-[Cyclohexylamino]-l-propanesulfonic acid, 10% methanol). The 
membrane is stained with 0.2% Ponceau S and destained with Milli Q quality water. 
Antigens within the preparation are then identified using western 
1 5 immunoblotting. After electroblotting, non specific binding sites of the PVDF 

membrane are blocked with 5% non fat dry milk in 10 mM Tris-HCl-0.9% NaCl, pH 
7.5. The membrane is incubated with a appropriate dilution of normal mouse serum in 
10 mM Tris-HCl-0.9% NaCl-0.5% Tween 20-0.5% BSA, pH 7.5, for 2 h at 37(C and 
then washed three times with 10 mM Tris-HCl-0.9% NaCl-0.5% Tween 20, pH 7.5 
20 (TTBS). Alkaline phosphatase conjugated anti-mouse Ig, from goat is then added in 10 
mM Tris-HCl-0.9% NaCl-0.5% Tween 20-0.5% BSA, pH 7.5 and incubated for lh at 
room temperature. After this incubation, the membrane is washed three times in TTBS. 
The reactive bands are revealed using 5-bromo-4-chloro-3-indoIyl phosphate (Bio-Rad) 
as the Alkaline phosphatase substrate and Nitro Blue Tetrazolium (Bio-Rad) as the color 
25 development reagent. 

For amino acid microsequencing, proteins that are identified as immunoreactive 
are cut from a fresh unreacted immobilon membranes and microsequenced at the 
Worcester Foundation microsequencing facility. Membranes from which the protein 
bands are cut are then subjected to western immunoblot as described above to confirm 
30 that the appropriate band had been excised. 

V. Analysis Of H. Pylori Proteins As Vaccine Candidates 

To investigate the immunomodulatory effect of H. pylori proteins, a mouse///. 
pylori model was used. This model mimics the human H. pylori infection in many 
35 respects. The focus is on the effect of oral immunization in H. pylori infected animals 
in order to test the concept of therapeutic oral immunotherapy. 
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Animals 

Female SPF BALB/c mice were purchased from Bomholt Breeding center 
(Denmark). They were kept in ordinary makrolon cages with free supply of water and 
food. The animals were 4-6 weeks old at arrival. 
5 Infection 

After a minimum of one week of acclimatization, the animals were infected with 
a type 2 strain (VacA negative) of//, pylori (strain 244, originally isolated from an ulcer 
patient). In our hands, this strain has earlier proven to be a good colonizer of the mouse 
stomach. The bacteria were grown overnight in Brucella broth supplemented with 10 % 
10 fetal calf serum, at 37°C in a microaerophilic atmosphere (10% C0 2 , 5%O z ). The 
animals were given an oral dose of omeprazole (400 jimol/kg) and 3-5 h after this an 
oral inoculation of //. pylori in broth (approximately 10 8 cfu/animal). Positive take of 
the infection was checked in some animals 2-3 weeks after the inoculation. 

Antigens 

1 5 Recombinant H. pylori antigens were chosen based on their association with 

externally exposed H. pylori cell membrane. These antigens were selected from the 
following groups: (1.) Outer Membrane Proteins; (2.) Periplastic/Secreted proteins; (3.) 
Outer Surface proteins; and (4.) Inner Membrane proteins. All recombinant proteins 
were constructed with a hexa-HIS tag for purification reasons and the non-Helicobacter 

20 pylori control protein (ji-galactosidase from E. coli; LacZ), was constructed in the same 
way. 

All antigens were given in a soluble form, i.e. dissolved in either a HEPES buffer 
or in a buffer containing 0.5% Deoxycholate (DOC). 
The antigens are listed in Table 8 below. 

25 

Table 8 

Helicobacter pylori proteins 

Outer membrane Proteins 

30 Protein 1 
Protein 2 
Protein 3 
Protein 4 
Protein 5 

35 

Periplastic/Secreted proteins 
Protein 6 

Other cell envelope pr teins 

40 Protein 7 
Protein 8 
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Flagella-associated pr teins 

Protein 9 

Control proteins 

5 p-galactosidase (LacZ) 

Immunizations 

Ten animals in each group were immunized 4 times over a 34 day period (day 1 , 
15, 25 and 35). Purified antigens in solution or suspension were given at a dose of 100 

10 ng/mouse. As an adjuvant, the animals were also given 10 |ig/mouse of Cholera toxin 
(CT) with each immunization. Omeprazole (400 fimol/kg) was given orally to the 
animals 3-5 h prior to immunization as a way of protecting the antigens from acid 
degradation. Infected control animals received HEPES buffer + CT or DOC buffer + 
CT. Animals were sacrificed 2-4 weeks after final immunization. A general outline of 

1 5 the study is shown in Table 9 below. 

Table 9 

Study outline, therapeutic immunization: 



20 Mice were all infected with H. pylori strain Ah244 at day 30. 







Mouse strain 




Dates for 




Substance 


nflO 


Dose/mouse 


dosing 


25 


I . Controls, PBS 


Balb/c 


0.3 ml 


0, 14, 24, 34 




2. Cholera toxin, 10 ug 


Balb/c 


0.3 ml 


0, 14, 24, 34 




3. Protein I, 100 jig + CT 10 ug 


Balb/c 


0.3 ml 


0, 14, 24,34 




4. Protein 5, 100 ug + CT 10 ug 


Balb/c 


0.3 ml 


0, 14,24, 34 




5. Protein 10, 100 ug + CT 10 ug 


Balb/c 


0.3 ml 


0, 14, 24, 34 


30 


6. Protein 9, 1 00 ug + CT 1 0 ug 


Balb/c 


0.3 ml 


0, 14, 24, 34 




7. Protein 2, 100 ug + CT 10 ug 


Balb/c 


0.3 ml 


0, 14, 24, 34 




8. Protein 6, 100 ug + CT 10 ug 


Balb/c 


0.3 ml 


0, 14,24, 34 




9. Protein 4, 1 00 ug + CT 10 ug 


Balb/c 


0.3 ml 


0, 14, 24, 34 




10. Protein 7, 100 ug + CT 10 ug 


Balb/c 


0.3 ml 


0, 14, 24, 34 


35 


1 1 . Protein 8, 1 00 ug + CT 10 ug 


Balb/c 


0.3 ml 


0, 14,24, 34 




12. Protein 3, 100 ug + CT 10 ug 


Balb/c 


0.3 ml 


0, 14, 24, 34 



Analysis of infection 

Mucosal infection: The mice were sacrificed by CO2 and cervical dislocation. 
40 The abdomen was opened and the stomach removed. After cutting the stomach along 

the greater curvature, it was rinsed in saline. The mucosa from the antrum and corpus of 
an area of 25mm 2 was scraped separately with a surgical scalpel. The mucosa scraping 
was suspended in Brucella broth and plated onto Blood Skirrow selective plates. The 
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plates were incubated under microaerophilic conditions for 3-5 days and the number of 
colonies was counted. The identity of H pylori was ascertained by urease and catalase 
test and by direct microscopy or Gram staining. 

The urease test was performed essentially as follows. The reagent, Urea Agar 
5 Base Concentrate, was purchased from DIFCO Laboratories, Detroit, MI (Catalog # 
0284-6 1 -3). Urea agar base concentrate was diluted 1:10 with water. 1 ml of if the 
diluted concentrate was mixed with 100-200 ^il of actively growing H pylori cells. 
Color change to magenta indicated that cells were urease positive. 

The catalase test was performed essentially as follows. The reagent, N,N,N\N f - 
1 0 Tetramethyl-p-Phenylenediamine, was purchased from Sigma, St. Louis, MO (Catalog # 
T3 134). A solution of the regent (1% w/v in water) was prepared. H. pylori cells were 
swabbed onto Whatman filter paper and overlaid with the 1% solution. Color change to 
dark blue indicated that the cells were catalase positive. 

Serum antibodies: From all mice serum was prepared from blood drawn by heart 
1 5 puncture. Serum antibodies were identified by regular ELISA techniques, where the 
specific antigens of Helicobacter pylori were plated. 

Mucosal antibodies: Gentle scrapings of a defined part of the corpus and of 4 cm 
of duodenum were performed in 50% of the mice in order to detect the presence of 
antibodies in the mucous. The antibody titers were determined by regular ELISA 
20 technique as for serum antibodies. 

Statistical analysis: Wilcoxon-Mann- Whitney sign rank test was used for 
determination of significant effects of the antigens on Helicobacter pylori colonization. 
PO.05 was considered significant. Because the antrum is the major colonization site for 
Helicobacter most emphasis was put upon changes in the antral colonization. 

25 

Results 

Antibodies in sera : All antigens tested given together with CT gave rise to a 
measurable specific titer in serum. The highest responses were seen with Proteins 3, 4, 
9, 1, and 7 (see Figure 1). 
30 Antibodies in mucus : In the mucus scrapings, specific antibodies against all 

antigens tested were seen. By far the strongest response was seen with Protein 6, 
followed by 1,3, and 9 (see Figure 2). 

Therapeutic immunization effects: 

All control animals (BALB/c mice) were well colonized with H. pylori (strain 
35 AH244) in both antrum and corpus of the stomach. Of the antigens tested 3 proteins 
(Proteins 4, 7, and 1) gave a good and significant reduction and/or eradication of the H 
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pylori infection. The degree of colonization of the antrum was lower following 
immunization with Proteins 8, 9, and 3 compared to control. The effect of Proteins 5, 2, 
and 6 did not differ from control. The control protein lacZ, i.e. the non-// pylori 
protein, had no eradication effect and in fact had higher Helicobacter colonization 
5 compared to the HEPES + CT control. All data are shown in Figures 3 and 4 for 

proteins dissolved in HEPES and DOC respectively. Data is shown as geometric mean 
values. n=8-10 Wilcoxon-Mann- Whitney sign rank test * = p<0.05; x/10 = number of 
mice showing eradication of//, pylori over the total number of mice examined. 

The data presented indicate that all of the H. pylori associated proteins included 

10 in this study, when used as oral immunogens in conjunction with the oral adjuvant CT, 
resulted in stimulation of an immune response as measured by specific serum and 
mucosal antibodies. A majority of the proteins led to a reduction, and in some cases 
complete clearance of the colonization of K pylori in this animal model. It should be 
noted that the reduction or clearance was due to heterologous protection rather than 

1 5 homologous protection (the polypeptides were based on the K pylori J99 strain 

sequence and used in the therapeutic immunization studies against a different (AH244) 
challenge strain), indicating the vaccine potential against a wide variety of H pylori 
strains. 

The highest colonization in the antrum was seen in animals treated with the non- 
20 Helicobacter protein LacZ, indicating that the effects seen with the Helicobacter pylori 
antigens were specific. 

Taken together these data strongly support the use of these H pylori proteins in a 
pharmaceutical formulation for the use in humans to treat and/or prevent H. pylori 
infections. 

25 

VI. Sequence Variance Analysis of genes in Helicobacter pylori strains 

Four genes were cloned and sequenced from several strains of//, pylori to 
compare the DN A and deduced amino acid sequences. This information was used to 
determine the sequence variation between the H. pylori strain, J99, and other //. pylori 
30 strains isolated from human patients. 

Preparation of Chromosomal DNA. 

Cultures of H. pylori strains (as listed in Table 12) were grown in BLBB (1% 
Tryptone, 1% Peptamin 0.1% Glucose, 0.2% Yeast Extract 0.5% Sodium Chloride, 5% 
Fetal Bovine Serum) to an OD 6 oo of 0.2. Cells were centrifuged in a Sorvall RC-3B at 
35 3500 x g at 4°C for 15 minutes and the pellet resuspended in 0.95 mis of 10 mM Tris- 
HC1, 0. 1 mM EDTA (TE). Lysozyme was added to a final concentration of lmg/ml 
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along with, SDS to 1% and RNAse A + Tl to 0.5mg/ml and 5 units/ml respectively, and 
incubated at 37°C for one hour. Proteinase K was then added to a final concentration of 
0.4mg/ml and the sample was incubated at 55 C for more than one hour. NaCl was 
added to the sample to a concentration of 0.65 M, mixed carefully, and 0.15 ml of 10% 
5 CTAB in 0.7M NaCL (final is 1% CTAB/70mM NaCL) was added followed by 
incubation at 65°C for 20 minutes. At this point, the samples were extracted with 
chloroformrisoamyl alcohol, extracted with phenol, and extracted again with 
chloroformrisoamyl alcohol. DNA was precipitated with either EtOH (1.5 x volumes) or 
isopropanol (0.6 x volumes) at -70°C for 1 Ominutes, washed in 70% EtOH and 
1 0 resuspended in TE. 

PCR Amplification and cloning. 

Genomic DNA prepared from twelve strains of Helicobacter pylori was used as 
the source of template DNA for PCR amplification reactions (Current Protocols in 

15 Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). To 
amplify a DNA sequence containing an K pylori ORF, genomic DNA ( 1 0 nanograms) 
was introduced into a reaction vial containing 2 mM MgCl 2 , 1 micromolar synthetic 
oligonucleotide primers (forward and reverse primers, see Table 10) complementary to 
and flanking a defined H. pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; 

20 dATP, dGTP, dCTP, dTTP and 0.5 units of heat stable DNA polymerase (Amplitaq, 
Roche Molecular Systems, Inc., Branchburg, NJ, USA) in a final volume of 20 
microliters in duplicate reactions. 

Table 10 

25 Oligonucleotide primers used for PCR amplification of H. pylori DNA sequences. 



Outer membrane 
Proteins 


Forward primer 5' to 3' 


Reverse Primer 5' to V 


Protein 1 1 (for strains 
AH4, AH15, AH61, 
5294, 5640, AH 18, and 
AH244) 


5-TTAACCATGGTGAAAAGC 
GATA-3' (SEQ ID NO: 1091) 


5'-TAGAATTCGCCTCTAAAA 
CTTTAG-3' (SEQ ID NO: 1092) 


Protein 11 (for strains 
AH5, 5155, 7958, 
AH24.and J99) 


5-TTAACCATGGTGAAAAGC 
GATA-3' (SEQ ID NO: 1093) 


5'-TAGAATTCGCATAACGAT 
CAATC-3 1 (SEQ ID NO: 1 094) 


Protein 12 


5-ATATCCATGGTGAGTTTG 
ATGA-3* (SEQ ID NO: 1095) 


5 -ATGAATTCAATTTTTTATT 
TTGCCA-3' (SEQ ID NO: 1096) 


Protein 13 


5'-AATTCCATGGCTATCCAA 
ATCCG-3' (SEQ ID NO: 1097) 


SVATGAATTCGCCAAAATC 
GTAGTATT-3- (SEQ ID NO: 1098) 


Protein 14 


5-GATACCATGGAATTTATGA 
AAAAG-3* (SEQ ID NO: 1099) 


5-TGAATTCGAAAAAGTGT 
AGTTATAC-3* (SEQ ID NO: 11 00) 
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The following thermal cycling conditions were used to obtain amplified DNA 
products for each ORF using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 
thermal cycler: 

5 Sequences for Proeins 12 and 14; 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 30°C for 15 sec and 72°C for 1.5 min 
23 cycles at 94°C for 15 sec, 55°C for 15 sec and 72°C for L5 min 
Reactions were concluded at 72°C for 6 minutes. 

10 

Sequence for Protem 1 1 for strains AH5, 5155, 7958, AH24,and J99; 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 1 5 sec, 30°C for 1 5 sec and 72°C for 1 .5 min 
25 cycles at 94°C for 1 5 sec, 5 q 5°C for 15 sec and 72°C for 1.5 min 
1 5 Reaction was concluded at 72°C for 6 minutes. 

Sequences for Protein 1 1 and Protein 13 for strains AH4, AH 15, AH61, 5294, 5640, 
AH 18, and Hp244 ; 
Denaturation at 94°C for 2 min, 
20 2 cycles at 94°C for 1 5 sec, 30°C for 20 sec and 72°C for 2 min 
25 cycles at 94°C for 15 sec, 55°C for 20 sec and 72°C for 2 min 
Reactions were concluded at 72°C for 8 minutes. 



Upon completion of thermal cycling reactions, each pair of samples were 
25 combined and used directly for cloning into the pCR cloning vector as described below. 
Cloning of H. pylori DNA sequences into the pCR TA cloning vector. 

All amplified inserts were cloned into the pCR 2. 1 vector by the method 
described in the Original T A cloning kit (Invitrogen, San Diego, C A). Products of the 
ligation reaction were then used to transform the TOPI OF' (INVaF' in the case of H. 
30 pylori sequence 350) strain of E. coli as described below. 

Transformation of competent bacteria with recombinant plasm ids 
Competent bacteria, E coli strain TOPI OF" or £. coli strain INVaF' were 
transformed with recombinant pCR expression plasmids carrying the cloned H. pylori 
sequences according to standard methods (Current Protocols in Molecular Biology, John 
35 Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). Briefly, 2 microliters of 0.5 
micromolar BME was added to each vial of 50 microliters of competent cells. 
Subsequently, 2 microliters of ligation reaction was mixed with the competent cells and 
incubated on ice for 30 minutes. The cells and ligation mixture were then subjected to a 
"heat shock" at 42°C for 30 seconds, and were subsequently placed on ice for an 
40 additional 2 minutes, after which, samples were incubated in 0.45 milliliters SOC 
medium (0.5% yeast extract, 2.0 % tryptone, 10 mM NaCl, 2.5 mM KC1, 10 mM 
MgC12, 10 mM MgS04 and 20, mM glucose) at 37°C with shaking for 1 hour. Samples 
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were then spread on LB agar plates containing 25 microgram/ml kanamycin sulfate or 
100 rnicrograrns/ml ampicillan for growth overnight. Transformed colonies of TOPI 0F ? 
or INVaF were then picked and analyzed to evaluate cloned inserts as described below. 
Identification of recombinant PCR plasmids carrying H. pylori sequences 
5 Individual TOPI OF' or INVaF* clones transformed with recombinant pCR- 

H.pylori ORFs were analyzed by PCR amplification of the cloned inserts using the same 
forward and reverse primers, specific for each H. pylori sequence, that were used in the 
original PCR amplification cloning reactions. Successful amplification verified the 
integration of the H. pylori sequences in the cloning vector (Current Protocols in 

10 Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). 

Individual clones of recombinant pCR vectors carrying properly cloned H. pylori 
ORFs were picked for sequence analysis. Sequence analysis was performed on ABI 
Sequencers using standard protocols (Perkin Elmer) using vector-specific primers (as 
found in PCRII or pCR2.1, Invitrogen, San Diego, CA) and sequencing primers specific 

15 to the ORF as listed in Table 1 1 below. 



Table 1 1 

Oligonucleotide primers used for sequencing of H. pylori DNA sequences. 



Outer 

membrane 

Proteins 


Forward primers 5* to 3' 


Reverse Primers 5* to 3* 


Protein 1 1 


S'-CCCTTCATTTTAGAAATCGO' 
(SEQ ID NO: 1 101) 
5'-ATTTCAACCAATTCAATGCG-3' 
(SEQ ID NO: 11 02) 
S'-GCCCCTTTTG A TTTG A A GCT- 3 * 
(SEQ ID NO: 1103) 
5-TCGCTCCAAGATACCAAG 
AAGT-3* (SEQ ID NO: 1 1 04) 
S'-CTTGAATTAGGGGCAAAG 
ATCG-3' (SEQ ID NO: 1 105) 
5-ATGCGTTTTTACCCAAAGA 
AGT-3* (SEQ ID NO: 1 1 06) 
5'- AT A ACGCC ACTTCCTTATTG 
GT-3' (SEQ ID NO: 1107) 


5 f -CTTTGGGTAAAAACGCATC-3* 
(SEQ ID NO: 1108) 
5 t -CGATCTTTGATCCTAATTCA-3' 
(SEQ ID NO: 1109) 
5*-ATCAAGTTGCCTATGCTGA-3' 
(SEQ ID NO: 11 10) 


Protein 12 


5-TTGAACACTTTTGATTATG 
CGG-3* (SEQ ID NO:llll) 
S'-GGATTATGCGATTG 1T1TA 
CAAG-3 1 (SEQ ID NO: 1112) 


S'-GTCTTTAGCAAAAATGGCGTC- 
3 f (SEQ ID NO:lll3) 

S'-AATGAGCGTAAGAGAGCCTTC- 
3* (SEQ ID NO: 11 14) 


Protein 13 


5-CTTATGGGGGTATTGTCA-3' 
(SEQ ID NO: 11 15) 
5 f -AGCATGTGGGTATCCAGC-3* 
(SEQIDNO:1116) 


S'-AGGTTGTTGCCTAAAGACTo 
(SEQ ID NO: 11 17) 
5*-CTGCCTCCACCTTTGAT(^3 , 
(SEQIDNO:1118) 
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Protein 14 


5 -ACCAA rATCAATT GGCACT-3 
(SEQ ID NO: 111 9) 
S'-ACTTGGAAAAGCTCTGCAO* 
(SEQ ID NO: 1120) 


5 -CTTGCTTGTCATATCTAGC-3 
(SEQ lDNO:1121) 
S'-GTTGAAGTGTTGGTGCTA-S* 
(SEQ ID NO: 1122) 




5'-CAAGCAAGTGG7 TTGG TI T 1 
AG-3* (SEQ ID NO: 11 23) 
5-TGGAAAGAGCAAATCATTG 
AAG-3' (SEQ ID NO: 1 124) 


5'-GCCCATAATCAAAAAGCCCAT- 
3* (SEQ ID NO: 1125) 
5'-CTAAAACCAAACCACTTGC 
TTGTC-3' (SEQ ID NO: 1 126) 


Vector 
Primers 


5 , -GTAAAACGACGGCCAG-3 t 
(SEQ ID NO: 1127) 


5-CAGGAAACAGCTATGAC-3' 
(SEQ ID NO: 1128) 



Results 

To establish the PCR error rate in these experiments, five individual clones of 
Protein 1 1 , prepared from five separate PCR reaction mixtures from K pylori strain J99, 
5 were sequenced over a total length of 897 nucleotides for a cumulative total of 4485 
bases of DNA sequence. DNA sequence for the five clones was compared to the DNA 
sequence of Protein 1 1 obtained previously by a different method, i.e., random shotgun 
cloning and sequencing. The PCR error rate for the experiments described herein was 
determined to be 2 base changes out of 4485 bases, which is equivalent to an estimated 

1 0 error rate of less than or equal to 0.04%. 

DNA sequence analysis was performed on four different open reading frames 
identified as genes and amplified by PCR methods from a dozen different strains of the 
bacterium Helicobacter pylori. The deduced amino acid sequences of three of the four 
open reading frames that were selected for this study showed statistically significant 

1 5 BLAST homology to defined proteins present in other bacterial species. Those ORFs 
included: Protein 1 1 , homologous to the val A & B genes encoding an ABC transporter 
in F. novicida; Protein 12, homologous to lipoprotein e (P4) present in the outer 
membrane of H. influenzae; Protein 13, homologous to fee A, an outer membrane 
receptor in iron (III) dicitrate transport in E. coli. Protein 14 was identified as an 

20 unknown open reading frame, because it showed low homology with sequences in the 
public databases. 

To assess the extent of conservation or variance in the ORFs across various 
strains of H. pylorL changes in DNA sequence and the deduced protein sequence were 
compared to the DNA and deduced protein sequences found in the J99 strain of//. 

25 pylori (see Table 12 below). Results are presented as percent identity to the J99 strain of 
H. pylori sequenced by random shotgun cloning. To control for any variations in the 
J99 sequence each of the four open reading frames were cloned and sequenced again 
from the J99 bacterial strain and that sequence information was compared to the 
sequence information that had been collected from inserts cloned by random shotgun 

30 sequencing of the J99 strain. The data demonstrate that there is variation in the DNA 
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sequence ranging from as little as 0.12 % difference (Protein 14 ? J99 strain) to 
approximately 7% change (Protein 1 1, strain AH5). The deduced protein sequences 
show either no variation (Protein 14, strains AH 1 8 and AH24) or up to as much as 
7.66% amino acid changes (Protein 1 1, Strain AH5). 

5 

TABLE 12 



Multiple Strain DNA Sequence analysis of H. pylori Vaccine Candidates 



J99 Protein U: 


11 


11 


12 


12 


13 


13 


14 


14 


Length of Region 


248 a.a. 


746 nt. 


232 a.a. 


696 nt. 


182 a.a. 


548 nt. 


273 a.a. 


819 nt. 


Sequenced: 


















Strain Tested 




















AA 


Nuc. 


AA 


Nuc. 


AA 


Nuc. 


AA 


Nuc. 




identity 


identity 


identity 


identity 


identity 


identity 


identity 


identity 


J99 


100.00% 


100.00% 


100.00% 


100.00% 


100.00% 


100.00% 


99.63% 


99.88% 


AH244 


95.16% 


95.04% 


n.d. 


n.d. 


99.09% 


96.71% 


98.90% 


96.45% 


AH4 


95.97% 


95.98% 


97.84% 


95.83% 


n.d. 


n.d. 


97.80% 


95.73% 


AH5 


92.34% 


93.03% 


98.28% 


96.12% 


98.91% 


96.90% 


98.53% 


95.73% 


AH15 


95.16% 


94.91% 


97.41% 


95.98% 


99.82% 


97.99% 


99.63% 


96.09% 


AH6I 


n.d. 


n.d. 


97.84% 


95.98% 


99.27% 


97.44% 


n.d. 


n.d. 


5155 


n.d. 


n.d. 


n.d. 


n.d. 


99.45% 


97.08% 


98.53% 


95.60% 


5294 


94.35% 


94.37% 


98.28% 


95.40% 


99.64% 


97.26% 


97.07% 


95.48% 


7958 


94.35% 


94.10% 


97.84% 


95.40% 


n.d. 


n.d. 


99.63% 


96.46% 


5640 


95.16% 


94.37% 


97.41% 


95.69% 


99.09% 


97.63% 


98.53% 


95.48% 


AHI8 


n.d. 


n.d. 


98.71% 


95.69% 


99.64% 


97.44% 


100.00% 


95.97% 


AH24 


94.75% 


95.04% 


97.84% 


95.40% 


99.27% 


96.71% 


100.00% 


96.46% 


n.d. - not done 



















VII. Experimental Knock-Out Protocol for the Determination of Essential H. pylori 
Genes as Potential Therapeutic Targets 

1 5 Therapeutic targets were chosen from genes whose protein products appear to 

play key roles in essential cell pathways such as cell envelope synthesis, DNA synthesis, 

transcription, translation, regulation and colonization/virulence. 

The protocol for the deletion of portions of H. pylori genes/ORFs and the 

insertional mutagenesis of a kanamycin-resistance cassette was modified from 
20 previously published methods (Labigne-Roussel et al., 1988, J. Bacteriology 170, pp. 

1704-1 708; Cover et al.,1994, J. Biological Chemistry 269, pp. 10566-10573; Reyrat et 

al., 1995, Proc. Natl. Acad. Sci. 92, pp 8768-8772). 

Identification and Cloning of H. pylori Gene Sequences 
25 The sequences of the genes or ORFs (open reading frames) selected as knock-out 

targets were identified from the //. pylori genomic sequence and used to design primers 
to specifically amplify the genes/ORFs. All synthetic oligonucleotide primers (Table 
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13) were designed with the aid of the OLIGO program (National Biosciences, Inc., 
Plymouth, MN 55447, USA), and were purchased from Gibco/BRL Life Technologies 
(Gaithersburg, MD, USA). Specific primers (Fl and Rl) were chosen which flanked 
most or all of the ORF, depending on its size. If the ORF was smaller than 800 to 1000 
5 base pairs, flanking primers were chosen outside of the open reading frame. 
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p 



Q> 

a 



o 
o 
c 

CO 8 

< 2 



C/3 





o£ 


AACACTTCAATTT 
CCTCTATA 
(SEQ ID NO: II 34) 


TAGGGTGTCTAG 
GGATTTGAT 
(SEQ ID NO: 1140) 


GCTAAAACTTCAT 
CGCTCAAT 
(SEQ ID NO: 1146) 


TGGTCTATCATGC 
GAATTAT 
(SEQ ID NO: II 52) 


TTTCACCGGCAAT 
TTTAGGC 
(SEQ ID NO: II 58) 


AATCAAAAACAA 
GAGCGTGG 
(SEQ ID NO: 1164) 


AACGGGTTTGAC 
ACTGATGA 
(SEQ ID NO: 1170) 


Targeting Primers 


m 
U. 


AATCCGTTTCGCT 
AATTTAGT 
(SEQ ID NO: 1133) 


ATGTCCGTTGTCT 
GTATGGAA 
(SEQ ID NO: 1139) 


GTTAGGAAATTAG 
AAATCATTG 
(SEQ ID NO: 1145) 


GGGGCATTGTGTT 
TGTIHT 
(SEQ1DNO:! 151) 


GGCGTTATTAAGC 
GACATCG 
(SEQ ID NO: II 57) 


AAATCTCTGTGGG 

CTTAGTG 

(SEQ ID NO: 1163) 


TGCCCAAAAATCC 
ACTAACG 
(SEQ ID NO: 1169) 




<N 
OC 


CGGGTCAAAACGA 
CCACTTAA 
(SEQ ID NO: 1132) 


CCTTTATTGG ITTT 
GATCGTG 
(SEQ ID NO: 1138) 


GTGGCTAAAAATG 
AGGGCTT 
(SEQ ID NO: 1144) 


CGTGGTGGTTTTC 
CCGTTAG 
(SEQ ID NO: II 50) 


CTGTTT ITAGCGTC 
CCTGTA 

(SEQ ID NO: II 56) 


GGAATTTTAACGC 

TCTTT'I 1 

(SEQ ID NO: 1162) 


CGCTAATCACATC 

CTTTCTT 

(SEQ ID NO: 1168) 


Deletion-creating 
Primers 


r-j 
U- 


TCTTGCATCTTAAT 

CCACTCC 

(SEQIDN0:II3I) 


ATAGAGAGCGTTG 
TGTTTAGC 
(SEQ ID NO: 11 37) 


GCAAATCCCCAGC 
CACTTCC 
(SEQ ID NO: 1143) 


CATTGATGCCTAA 
AACTTCG 
(SEQ ID NO: 1149) 


GCCCTGATCCATT 
CCCCCCT 
(SEQ ID NO: 1155) 


GCGTGTTTTCTAA 
GGGTTCA 
(SEQ ID NO: 1161) 


TCACGCTTTCTAA 
ATCATCA 
(SEQ ID NO: 1167) 






AGAGCGTATTTCA 
CCCGAAAG 
(SEQ ID NO: 11 30) 


TTGACTAAACACA 
TGCGAGAA 
(SEQ ID NO: 1136) 


GAAATGGAAAATA 
GCGGTCAA 
(SEQ ID NO: 11 42) 


CAAACAAACCTGA 
CAAGAAAC 
(SEQ ID NO: 1148) 


CGCGCTAGAGGCT 
TGTAAAA 
(SEQ ID NO: 1154) 


GTAAAAACACCGC 
TAACGCAT 
(SEQ ID NO: 1160) 


GGAGGGCGCAA TT 
AAACATCG 
(SEQ ID NO: 1166) 


Cloning Primers 




TTGCCCCATCGTA 
TTGATAGA 
(SEQ ID NO: 1129) 


TGGTATAAGGATT 
TGAATGGA 
(SEQ ID NO: 1135) 


GCGTTTGGCTTCTT 

CGTTGTC 

(SEQ ID NO: 1141) 


GTTGGGCAGAAAA 
TAAGGTGA 
(SEQ ID NO: 1147) 


GCGTTTGGGGATT 
TGATGTTC 
(SEQ ID NO: 1153) 


GCGTTTTGATTCT 
GTCTGTTA 
(SEQ ID NO: 1159) 


GCCCCAGCCCCAT 
AATACAAA 
(SEQ ID NO: 1165) 




Access- 
ion 

Number 


P23329 


P29820 


P34828 


PI 4900 


P22I88 


P07874 


PI9358 




Gene 
Name 


rnh 


ppiB 




MurD 


UJ 

Urn 

3 

5 


AlgA 


metL 
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GGAGCAATCAGC 
CATTTTTC 
(SEQ ID NO: 1176) 


CAAAAGAGGCGG 
GTTCATGC 
(SEQ ID NO: 1182) 


AATGGTTTTCCTA 
TACCCTTGA 
(SEQ ID NO: 1188) 


AATGGTTTTCCTA 
TACCCTTGA 
(SEQ ID NO; 1194) 


CCATTACATrrCG 
CCTC 

(SEQ ID NO: 1200) 


TTTTGGGGTAAAA 
AGGCTGAA 
(SEQ ID NO: 1206) 


AAAGGATACAAG 
GGGGA 

(SEQ ID NO: 12 12) 


TTTCAAAACGCTC 
ACCTTTTG 
(SEQ ID NO: 12 18) 


CTTCAGCGATACT 
AAAAGAT 
(SEQ ID NO: 1224) 


o 

3 if 

o < - 
f~ < o 

< < z 
^ G o 

£E© 

< < 52 


AAG TITATGCGGG 
CGAGATT 
(SEQ ID NO: 1175) 


ACGGGTTAGCAGG 

GCAGAAT 

(SEQIDN0:II81) 


CAAAAGATTGAAG 
CAGAAGAGT 
(SEQ ID NO: II 87) 


CAAAAGATTGAAG 
CAGAAGAGT 
(SEQ ID NO: 1193) 


ATAGGCATGCAGA 
ATTTTTCC 
(SEQ ID NO: II 99) 


TAAGTCAAGCCAT 
AAAACCAAA 
(SEQ ID NO: 1205) 


GCCCCATTAAAAT 

CCTTTTCT 

(SEQIDNO:121l) 


TTTGATAAACGCC 

CACTTTTT 

(SEQIDNO:12I7) 


AACAACCGCTAAA 
ATCAAAC 
(SEQ ID NO: 1223) 


AGGCTATTCAAGG 
fGGCTAAA 
(SEQ ID NO: 1229) 


GGATAGCCTGCCA 
AAACGCC 
(SEQ ID NO. 11 74) 


GCATTGATCGCAT 
TTTTAGCC 
(SEQ ID NO: II 80) 


CCCCTAATAGTCT 
GTCAATCAT 
(SEQ ID NO: 1186) 


CCCCTAATAGTCT 
GTCAATCAT 
(SEQ ID NO: 1192) 


TACGAGACAAAAT 
AGGGATTT 
(SEQ ID NO: 1 198) 


TTCTAAAAGGTGG 

TAATCTTC 

(SEQ ID NO: 1204) j 


GTAAGGCGTTAGA 
AAAATACC 
(SEQ ID NO: 12 10) 


CTTTTGGGGGAGT 
TTGACAAG 
(SEQ ID NO: 12 16) 


AATTAGCCCTTAA 
AATAGATG 
(SEQ ID NO: 1222) 


GGCTGGTAAATAC 
TGGATAG 
(SEQ ID NO: 1228) 


GGGCGATGTGATT 
GGCGATT 
(SEQ ID NO: 11 73) 


AGCGGGCAGTTTA 
GGACCAC 
(SEQ ID NO: 11 79) 


AGTGTGGTCGCCT 
GTGGTGGAG 
(SEQ ID NO: 1185) 


u ^ 
u — 

u o - 

P < d 

a a ~ 

f- (- O 
O H £ 

< o S2- 


GCTAGGATTTATG 

CCAATTTA 

(SEQIDNO.II97) 


GTTTTAAAAACGC 
CATAGCCA 
(SEQ ID NO: 1 203) 


CAGCCACACTTCA 
ATGTCTAT 
(SEQ ID NO: 1209) 


TGTTTGGAAATGC 

TGGTGATC 

(SEQIDNO:1215) 


ACAATAATAGGCT 
TTGTCTTC 
(SEQ ID NO. 1221) 


TTTTTGGGGGTAT 
GCTAAAA 
(SEQ ID NO: 1227) 


GCGTTTTTAAGAC 
TGAATACA 
(SEQ ID NO: 1172) 


CGGCCTCCITCAA 
ACACATT 
(SEQ ID NO: 1178) 


CATACACGCTCAC 
TTCATCG 
(SEQ ID NO: 1184) 


CATACACGCTCAC 
TTCATCG 
(SEQ ID NO: 1190) 


TCAACTGATAGGT 
AATATCCC 
(SEQ ID NO: 1 196) 


GGGCTTGTATTCA 
TTTTGTAA 
(SEQ ID NO: 1202) 


AGACAGCACCAGT 
1TGATAAA 
(SEQ ID NO: 1208) 


1 ! 1 1 TTAGGGAGG 
ATTGAGAT 
(SEQ ID NO: 12 14) 


A TAATGAGTTTGA 
TCGTTACG 
(SEQ ID NO: 1220) 


GCTGGATAAGGAT 

TTGCTCT 

(SEQ ID NO: 1226) 


GAA TGCGGTGG TT 

TTAGAGAG 

(SEQIDN0:I171) 


CTAGCGATTCAAG 1 

GCGATGG 

(SEQ ID NO: 1 177) 


TITAGAAGTCGTT 
GATGAGA 
(SEQ ID NO: 1183) 


GAGAGCAAATCCT 
TATCCAG 
(SEQ ID NO: 1189) 


TTGAAACCCCAAA 
AGTTTTAC 
(SEQ ID NO: II 95) 


CGATAGATATTGT 

AGAAGTCA 

(SEQIDNO:120I) 


ATCTTTTTGCCCTT 

GCTCATA 

(SEQ ID NO: 1207) 


CTCGCTCCATTTTA 

TCTTTTA 

(SEQIDN0:I2I3) 


< 

i_ — 
?- r o 

u h 7L 

P H 9 


TAGGGGCGATTGA 
AAACAGC 
(SEQ ID NO: 1225) 


XI 6278 


U09549 


VO 


M3769I 


U32794 


Ml 0040 


X050I7 


P00960 


1.14580 


M3769I 


< 
m 

,Z3 


ID 

u. 


set A 


FliM- 
selB 


MurC 


dnaE 


SerS 


an 


Gltx 


Sig28 
(IliA) 
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AAAACAAACGCA 

TCAAAAAT 

(SEQIDN0:I236) 


< 

a 3 

t- a Q 
§Sa 

O H 

P < 52 


TCCCTATCATAGC 
GTTAGTGC 
(SEQ ID NO: 1248) 


CGCAAAAGGGTA 
GGGGATAA 
(SEQ ID NO: 1254) 


ATTCAAAGAAAG 
CTGGCTGTCT 
(SEQ ID NO: 1260) 


AAAGATACTCCC 
CTGTGATTA 
(SEQ ID NO: 1266) 


CACTTGAGCGTTA 
GCAACAAT 
(SEQ ID NO: 1272) 


GCCTTTTTAGACA 
ACCCTACT 
(SEQ ID NO: 1278) 


CCCTAAATCCAA 
ATCAAGCAG 
(SEQ ID NO: 1284) 


CTTTTCCTTTCGCT 

TGAAGA 

(SEQIDNO:1235) 


TTTTTAAGGGCGT 
ATTTTTGT 
(SEQ ID NO: 1241) 


AGCGTTI TTGTAA 
GGGGGTAT 
(SEQ ID NO: 1247) 


CTTAGCCCCTTTA 

GTGTTTA 

(SEQIDNO:I253) 


GCGGTCATGGAAT 

TTTTAGA 

(SEQ ID NO: 1259) 


AAAAAGAAAGAA 
GAACTCGTG 
(SEQ ID NO: 1265) 


AAAGGTTTGAAAC 
AAGAAATCT 
(SEQ ID NO: 1271) 


GGAATACAATAAC 
GCATAAAT 
(SEQ ID NO: 1277) 


TATGCGGCTCATC 

CTATTAAA 

(SEQIDN0:I283) 


AAAATCGGTCTGA 
TGCTCTTA 
(SEQ ID NO: 1234) 


TGCAACAAAAATA 

CGCCCTT 

(SEQ ID NO: 1240) 


CGCTTTAACGCTC 
CTTTCAC 
(SEQ ID NO. 1246) 


GCTCATGGATATA 
AAGGGGTATT 
(SEQ ID NO: 1252) 


TAATTCTACAGAG 
TGGTTAATGG 
(SEQ ID NO: 1258) 


GTAAGTTTGAGCG 
GCTAATTC 
(SEQ ID NO: 1264) 


CGCCCCCAAAAAG 
TCGCAGTA 
(SEQ ID NO: 1270) 


GGGATTGTTACAG 
GAAAAGAT 
(SEQ ID NO: 1276) 


TTATTGTGGAGTT 
GCTTGTCA 
(SEQ ID NO: 1282) 


TGATTGTTGGGTA 

GCTCTCA 

(SEQIDNO:1233) 


CGCCCGAATGGAT 
GAGTAGG 
(SEQ ID NO: 1239) 


GCCCCATTCCTGTT 
TTTAGC 

(SEQ ID NO: 1245) 


CCAAAGCTAAAGC 

GGTGTTTT 

(SEQIDN0:I25I) 


AAAGATAACGCTA 
GGATTTCTAC 
(SEQ ID NO: 1257) 


ACCATGCTCATTA 

ACGCTAGG 

(SEQIDN0:I263) 


GCAGCGGCACAGC 
GACTTTAG 
(SEQ ID NO: 1269) 


CAAGGGCGTTTTT 
TGGGGTAT 
(SEQ ID NO: 1275) 


GCGATTATGGGGT 
ATTTATTG 
(SEQ ID NO: 128 1) 


1 » 

O m 
< 2 

o < o 

U < 2 

o a a 


GCACAAAGACCCC 
ACCACGAT 
(SEQ ID NO: 1238) 


GGCCCATTCTTTC 
GGATATT 
(SEQ ID NO: 1244) 


GCACAATTCCCAC 
ACGCTGC 
(SEQ ID NO: 1250) 


CAAACTTATCGCC 

CTCTCTA 

(SEQ ID NO: 1256) 


GAACCCCCTAAAA 
TGACAAT 
(SEQ ID NO: 1262) 


GCATGGTATTCAG 

CGTTTTC 

(SEQ ID NO: 1268) 


CGCTAGGAGAAAG 
GAAGGAAA 
(SEQ ID NO: 1274) 


GCTAGAAATGCCA 
TGAGAAAG 
(SEQ ID NO: 1280) 


GCAGTTGGCGGTA 

TTTGGTG 

(SEQIDN0:I23!) 


TTTCAAGGCGAGG 
AGGCAGAT 
(SEQ ID NO: 1237) 


CGCGCTCAAAATC 
CCTAAAT 
(SEQ ID NO: 1243) 


CTTAGGGGTTTTT 
AGCATGAA 
(SEQ ID NO: 1249) 


TTTTATTTTTAGAA 
ACGAATC 
(SEQ ID NO: 1255) 


GCTTGTGGGGGTT 
GTTTTAT 
(SEQ ID NO: 1261) 


CAAAGAAACGCAA 
AATACAG 
(SEQ ID NO: 1267) 


GGGGTGTTTGAAA 
TTGATAGA 
(SEQ ID NO: 1273) 


CCCCTAAACTCAA 
ATCTCAAT 
(SEQ ID NO: 1279) 


M73443 


U 12405 


D26185 


D 10602 


U32794 


M86305 


U09549 






.rP 

C/3 


Murl 


dnaB 


MurG 


IpxC 


< 
"5 


IpxB 


K024 


KO 26 
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Genomic DN A prepared from the Helicobacter pylori HpJ99 strain (ATCC 55679 ) was 
used as the source of template DN A for amplification of the ORFs by PCR (polymerase 
chain reaction) (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. 
Ausubel et al., editors, 1994). For the preparation of genomic DNA from //. pylori, see 
5 Example I. PCR amplification was carried out by introducing 10 nanograms of genomic 
HpJ99 DNA into a reaction vial containing 10 mM Tris pH 8.3, 50 mM FCCI, 2 mM 
MgCI 2 , 2 microMolar synthetic oligonucleotide primers (forward=Fl and reverse=Rl), 
0.2 mM of each deoxynucleotide triphosphate (dATP,dGTP, dCTP, dTTP), and 1 .25 
units of heat stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., 
1 0 Branchburg, NJ, USA) in a final volume of 40 microliters. The PCR was carried out 
with Perkin Elmer Cetus/GeneAmp PCR System 9600 thermal cyclers. The thermal 
cycling conditions used to obtain amplified DNA products for each knock-out target are 
shown in Table 14. 



15 TABLE 14 

PCR Conditions 

MurC 

Denaturation at 94°C for 2 min., 

25 cycles of 94°C for 15 sec, 48°C for 15 sec, 72°C for 1 min, 30 sec, 
20 Final Extension of 72°C for 20 minutes. 

Sjg54 

Denaturation at 94 C for 2 min., 

25 cycles of 94°C for 15 sec, 50°C for 15 sec, 72°C for 1 min, 30 sec, 
25 Final Extension of 72°C for 20 minutes, 
IpxC 

Denaturation at 94°C for 2 min., 

32 cycles of 94°C fox \5 sec, 50°C for 15 sec, 72°C for 1 min, 30 sec, 
Final Extension of 72°C for 20 minutes. 

30 

KQ24.KQ26.KQ27 
Denaturation at 94 C for 2 min., 

25 cycles of 94°C forJS sec, 50.5°C for 20 sec, 72°C for 2 min, 
Final Extension of 72°C for 20 minutes. 

35 

KQ29.26kDa Protein 
Denaturation at 94 C for 2 min., 

28 cycles of 94°C for 15 sec, 50.5°C for 20 sec, 72°C for 2 min, 
Final Extension of 72°C for 20 minutes. 

40 

DnaE,Glvcvl 

Denaturation at 94°C for 2 min., 

20 cycles of 94°C for 15 sec, 51°C for 15 sec, 72°C for 1 min, 30 sec, 
Final Extension of 72°C for 20 minutes. 

45 
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Gltx 

Denaturation at 94°C for 2 min.. 

25 cycles of 94°C for 1 5 sec, 51°C for 15 sec, 72°C for 1 min, 30 sec, 
Final Extension of 72°C for 20 minutes. 

5 

KQ28 

Denaturation at 94°C for 2 min., 

25 cycles of 94°C for 15 sec, 51°C for 15 sec, 72°C for 2min, 
Final Extension of 72°C for 20 minutes. 

10 

KO30 

Denaturation at 94°C for 2 min., 

25 cycles of 94°C forJS sec, 51.5°C for 15 sec, 72°C for lmin, 45 sec. 
Final Extension of 72°C for 20 minutes. 

15 

MurL MurG 

Denaturation at 94°C for 2 min., 

25 cycles of 94°C for 15 sec, 52°C for 15 sec, 72°C for 1 min, 30 sec, 
Final Extension of 72°C for 20 minutes. 

20 

DnaB. KdtA.LpxB 
Denaturation at 94 C for 2 min., 

27 cycles of 94°C for 15 sec, 52°C for 15 sec, 72°C for 1 min, 30 sec, 
Final Extension of 72°C for 20 minutes. 

25 

Tsf JlgE. FliM.Sig28.MurB 
Denaturation at 94 C for 2 min., 

30 cycles of 94°C for 15 sec, 52°C for 15 sec, 72°C for 1 min, 30 sec, 
Final Extension of 72°C for 20 minutes. 

30 

PpiB 

Denaturation at 94°C for 2 min., 

30 cycles of 94°C for 15 sec, 52°C for 15 sec, 72°C for 2 min, 30 sec, 
Final Extension of 72°C for 20 minutes. 

35 

MurD.MurE. AlgA.MetL.FusA.SerS.Rnh 
Denaturation at 94 C for 2 min., 

30 cycles of 94°C for 15 sec, 55°C for 15 sec, 72°C for 1 min, 30 sec, 
Final Extension of 72°C for 20 minutes. 

40 

Upon completion of thermal cycling reactions, each sample of amplified DNA 
was visualized on a 2% TAE agarose gel stained with Ethidium Bromide (Current 
Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 
1 994) to determine that a single product of the expected size had resulted from the 
45 reaction. Amplified DNA was then washed and purified using the Qiaquick Spin PCR 
purification kit (Qiagen, Gaithersburg, MD, USA). 

PCR products were cloned into the pT7Blue T-Vector (catalog#69820- 1 , 
Novagen, Inc., Madison, WI, USA) using the TA cloning strategy (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). The 
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ligation of the PCR product into the vector was accomplished by mixing a 6 fold molar 
excess of the PCR product, 10 ng of pT7Blue-T vector (Novagen), I microliter of T4 
DNA Ligase Buffer (New England Biolabs, Beverly, MA, USA), and 200 units of T4 
DNA Ligase (New England Biolabs) into a final reaction volume of 10 microliters. 
5 Ligation was allowed to proceed for 16 hours at 16°C. 

Ligation products were electroporated (Current Protocols in Molecular Biology, 
John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994) into electroporation- 
competent XL-1 Blue or DH5-a E.coli cells (Clontech Lab., Inc. Palo Alto, CA, USA). 
Briefly, 1 microliter of ligation reaction was mixed with 40 microliters of 

10 electrocompetent cells and subjected to a high voltage pulse (25 microFarads, 2.5 kV, 
200 ohms) after which the samples were incubated in 0.45 ml SOC medium (0.5% yeast 
extract, 2% tryptone, 10 mM NaCl, 2.5 mM KC1, 10 mM MgCl 2 , 10 mM MgS0 4 and 20 
mM glucose) at 37°C with shaking for 1 hour. Samples were then spread onto LB (10 
g/1 bacto tryptone, 5 g/1 bacto yeast extract, 10 g/1 sodium chloride) plates containing 

15 100 microgram/ml of Ampicillin, 0.3% X-gal, and 100 microgram/ml IPTG. These 

plates were incubated overnight at 37°C. Ampicillin-resistant colonies with white color 
were selected, grown in 5 ml of liquid LB containing 100 microgram/ml of Ampicillin, 
and plasmid DNA was isolated using the Qiagen miniprep protocol (Qiagen, 
Gaithersburg, MD, USA). 

20 To verily that the correct Kpylori DNA inserts had been cloned, these pT7Blue 

plasmid DNAs were used as templates for PCR amplification of the cloned inserts, using 
the same forward and reverse primers (Fl and Rl) used for the initial amplification of 
the J99 Kpylori sequence. Recognition of the primers and a PCR product of the correct 
size as visualized on a 2% TAE, ethidium bromide stained agarose gel were 

25 confirmation that the correct inserts had been cloned. Two to six such verified clones 
were obtained for each knock-out target, and frozen at -70°C for storage. To minimize 
errors due to PCR, plasmid DNA from these verified clones were pooled, and used in 
subsequent cloning steps. 

The sequences of the genes/ORFs were again used to design a second pair of 

30 primers (F2 and R2) which flanked the region of//, pylori DNA to be either interrupted 
or deleted (up to 250 basepairs) within the ORFs but were oriented away from each 
other. The pool of circular plasmid DNAs of the previously isolated clones were used as 
templates for this round of PCR. Since the orientation of amplification of this pair of 
deletion primers was away from each other, the portion of the ORF between the primer 

35 would not be included in the resultant PCR product. The PCR product was a linear 
piece of DNA with //. pylori DNA at each end and the pT7Blue vector backbone 
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between them which, in essence, resulted in the deletion of a portion of the ORFs. The 
PCR product was visualized on a 1% TAE, ethidium bromide stained agarose gel to 
confirm that only a single product of the correct size had been amplified. 

A Kanamycin-resistance cassette (Labigne-Roussel et ah, 1988 J. Bacteriology 
5 1 70, 1 704- 1 708) was ligated to this PCR product by the TA cloning method used 
previously (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. 
Ausubel et al., editors, 1994). The Kanamycin cassette containing a Campylobacter 
kanamycin resistance gene was obtained by carrying out an EcoRI digestion of the 
recombinant plasmid pCTB8:&?« (Cover et al.,1994, J. Biological Chemistry 269, pp. 

10 10566-10573). The proper fragment (1.4 kb) was isolated on a 1% TAE gel, and 

isolated using the QIAquick gel extraction kit (Qiagen, Gaithersburg, MD, USA). The 
fragment was end repaired using the Klenow fill-in protocol, which involved mixing 4ug 
of the DNA fragment, 1 microliter of dATP,dGTP, dCTP, dTTP at 0.5 mM, 2 microliter 
of Klenow Buffer (New England Biolabs) and 5 units of Klenow DNA Polymerase I 

1 5 Large (Klenow) Fragment (New England Biolabs) into a 20 microliter reaction, 

incubating at 30°C for 15 min, and inactivating the enzyme by heating to 75°C for 10 
minutes. This blunt-ended Kanamycin cassette was then purified through a Qiaquick 
column (Qiagen, Gaithersburg, MD, USA) to eliminate nucleotides. The OTO overhang 
was then generated by mixing 5 micrograms of the blunt-ended kanamycin cassette, 10 

20 mM Tris pH 8.3, 50 mM KC1, 2 mM MgCl 2 , 5 units of DNA Polymerase (Amplitaq, 
Roche Molecular Systems, Inc., Branchburg, NJ, USA), 20 microliters of 5 mM dTTP, 
in a 100 microliter reaction and incubating the reaction for 2 hours at 37°C. The "Kan- 
T" cassette was purified using a QIAquick column (Qiagen, Gaithersburg, MD, USA). 
The PCR product of the deletion primers (F2 and R2) was ligated to the Kan-T cassette 

25 by mixing 10 to 25 ng of deletion primer PCR product, 50 - 75 ng Kan-T cassette DNA, 
1 microliter lOx T4 DNA Ligase reaction mixture, 0.5 microliter T4 DNA Ligase (New 
England Biolabs, Beverly, MA, USA) in a 10 microliter reaction reaction and incubating 
for 16 hours at 16°C. 

The ligation products were transformed into XL-1 Blue or DH5-a E.coli cells by 

30 electroporation as described previously. After recovery in SOC, cells were plated onto 
LB plates containing 100 microgram/ml Ampicillin and grown overnight at 37°C. These 
plates were then replica plated onto plates containing 25 microgram/ml Kanamycin and 
allowed to grow overnight. Resultant colonies had both the Ampicillin resistance gene 
present in the pT7Blue vector, and the newly introduced Kanamycin resistance gene. 

35 Colonies were picked into LB containing 25 microgram/ml Kanamycin and plasmid 
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DNA was isolated using the Qiagen miniprep protocol (Qiagen, Gaithersburg, MD, 
USA). 

Several tests by PCR amplification were conducted on these plasmids to verify 
that the Kanamycin was inserted in the K pylori gene/ORF, and to determine the 
5 orientation of the insertion of the Kanamycin-resistance gene relative to the H. pylori 
gene/ORF. To verify that the Kanamycin cassette was inserted into the K pylori 
sequence, the plasmid DNAs were used as templates for PCR amplification with the set 
of primers (Fl and Rl) originally used to clone the H. pylori gene/ORFs. The correct 
PCR product was the size of the deleted gene/ORF but increased in size by the addition 

10 of a 1 .4 kilobase Kanamycin cassette. To avoid potential polar effects of the kanamycin 
resistance cassette on H. pylori gene expression, the orientation of the Kanamycin 
resistance gene with respect to the knock-out gene/ORF was determined and both 
orientations were eventually used in H. pylori transformations (see below). To 
determine the orientation of insertion of the kanamycin resistance gene, primers were 

1 5 designed from the ends of the kanamycin resistance gene ("Kan- 1 " 5'- 

ATCTTACCTATCACCTCAAAT-3' (SEQ ID NO: 1285), and H Kan-2 M 5*- 
AGACAGCAACATCTTTGTGAA-3' (SEQ ID NO: 1286)). By using each of the 
cloning primers (Fl and Rl) in conjunction with each of the Kan primers (4 
combinations of primers), the orientation of the Kanamycin cassette relative to the 

20 H.pylori sequence was determined. Positive clones were classified as either in the "A" 
orientation (the same direction of transcription was present for both the H. pylori gene 
and the Kanamycin resistance gene), or in the "B" orientation (the direction of 
transcription for the H.pylori gene was opposite to that of the Kanamycin resistance 
gene). Clones which shared the same orientation (A or B) were pooled for subsequent 

25 experiments and independently transformed into H. pylori. 

Transformation of Plasmid DNA into H. pylori cells 

Two strains of H. pylori were used for transformation: ATCC 55679 , the clinical 
isolate which provided the DNA from which H. pylori sequence database was obtained, 

30 and AH244, an isolate which had been passaged in, and had the ability to colonize the 
mouse stomach. Cells for transformation were grown at 37°C, 10% C0 2 , 100% 
humidity, either on Sheep-Blood agar plates or in Brucella Broth liquid. Cells were 
grown to exponential phase, and examined microscopically to determine that the cells 
were "healthy" (actively moving cells) and not contaminated. If grown on plates, cells 

35 were harvested by scraping cells from the plate with a sterile loop, suspended in 1 ml of 
Brucella Broth, spun down (1 minute, top speed in eppendorf micro fuge) and 
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resuspended in 200 microliters Brucella Broth. If grown in Brucella Broth liquid, cells 
were centrifuged (15 minutes at 3000 rpm in a Beckman TJ6 centrifuge) and the cell 
pellet resuspended in 200 microliters of Brucella broth. An aliquot of cells was taken to 
determine the optical density at 600 nm, in order to calculate the concentration of cells. 

5 An aliquot (1 to 5 OD 60 o units/25 microliter) of the resuspended cells was placed onto a 
prewarmed Sheep-Blood agar plate, and the plate was further incubated at 37°C, 6% 
C0 2 , 100% humidity for 4 hours. After this incubation, 10 microliters of plasmid DNA 
(100 micrograms per microliter), was spotted onto these cells. A positive control 
(plasmid DNA with the ribonuclease H gene disrupted by kanamycin resistance gene) 

10 and a negative control (no plasmid DNA) were done in parallel. The plates were 
returned to 37°C, 6% C0 2 for an additional 4 hours of incubation. Cells were then 
spread onto that plate using a swab wetted in Brucella broth, and grown for 20 hours at 
37°C, 6% C0 2 . Cells were then transferred to a Sheep-Blood agar plate containing 25 
micrograms/ml Kanamycin, and allowed to grow for 3 to 5 days at 37°C, 6% C0 2 , 

1 5 1 00% humidity. If colonies appeared, they were picked and regrown as patches on a 
fresh Sheep-Blood agar plate containing 25 micrograms/ml Kanamycin. 

Three sets of PCR (three tests) were done to verify that the colonies of 
transformants have arose from homologous recombination at the proper chromosomal 
location. The template for PCR (DNA from the colony) was obtained by a rapid boiling 

20 DNA preparation method. An aliquot of the colony (stab of the colony with a toothpick) 
was introduced into 100 microliters of 1% Triton X-100, 20 mM Tris, pH 8.5, and 
boiled for 6 minutes. An equal volume of phenol : chloroform (1:1) was added and 
vortexed. The mixture was microfuged for 5 minutes and the supernatant was used as 
DNA template for PCR with combinations of the following primers to verify 

25 homologous recombination at the proper chromosomal location. 

TEST 1 PCR with Fl and Rl primers (cloning primers originally used to 
amplify the gene/ORF). A positive result of homologous recombination at the correct 
chromosomal location should show a single PCR product whose size is expected to be 
the size of the deleted gene/ORF but increased in size by the addition of a 1 .4 kilobase 

30 Kanamycin cassette. A PCR product of just the size of the gene/ORF was proof that the 
gene had not been knocked out and that the transformant was not the result of 
homologous recombination at the correct chromosome location. 

TEST 2 PCR with F3 (primer designed from sequences upstream of the 
gene/ORF), and either primer Kan-1 or Kan-2 (primers designed from the ends of the 

35 kanamycin resistance gene), depending on whether the plasmid DNA used was of "A" 
or "B" orientation. A positive result of homologous recombination at the correct 
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chromosomal location of the sequences of the gene/ORFs upstream from the kanamycin 
resistance gene should show a single PCR product, the expected size to be from the 
location of F3 to the insertion site of kanamycin resistance gene. No PCR product or 
PCR product(s) of incorrect size(s) would prove that the plasmid had not been integrated 
5 at the correct site and that the gene had not been knocked out. 

TEST 3 PCR with R3 (primer designed from sequences downstream of the 
gene/ORF) and either primer Kan-1 or Kan-2, depending on whether the plasmid DNA 
used was of "A" or "B" orientation. A positive result of homologous recombination at 
the correct chromosomal location downstream from the kanamycin resistance gene 
10 would show a single PCR product, the expected size to be from the insertion site of 

kanamycin resistance gene to the downstream location of R3. Again, no PCR product or 
PCR product(s) of incorrect size(s) would prove that the plasmid had not been integrated 
at the correct site and that the gene had not been knocked out. 

Genes that are not essential for survival in vitro normally resulted in many 
1 5 transformants as observed for the positive control of ribonuclease H gene. Any 
transformants showing positive results for all three tests above would result in the 
conclusion that the gene was not essential for survival in vitro. 

Genes that are essential for survival in vitro normally showed very few 
transformants. All transformants would be screened. A negative result of any of the 
20 three above tests for each transformant would lead to the conclusion that the gene had 
not been disrupted, and that the gene was essential for survival in vitro. 

In the event that no colonies resulted from two independent transformations 
while the positive control with the disrupted ribonuclease H plasmid DNA produced 
transformants, the plasmid DNA was further analyzed by PCR on DNA from 
25 transformant populations prior to plating for colony formation, to verify that it can enter 
the cells and undergo homologous recombination at the correct site. Briefly, plasmid 
DNA was incubated according to the transformation protocol described above. DNA 
was extracted from the H. pylori cells immediately after incubation with the plasmid 
DNA and the DNA was used as templates for the above TEST 2 and TEST 3. Positive 
30 results of TEST 2 and TEST 3 would verify that the plasmid DNA could enter the cells 
and undergo homologous recombination at the correct chromosomal location. If TEST 
2 and TEST 3 are positive, then failure to obtain viable transformants indicates that the 
gene is essential and cells suffering a disruption in that gene are incapable of colony 
formation 

35 Genes used in these experiments have been found to be essential, non-essential, 

or are still in progress, as indicated in Table 15. 
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TABLE 15 



Summary of 

knock-out 

genes/ORFs 








Gene 


Accession 
Number 


Pathway 


Status 


rnh 


P23329 


Transcription 


Not essential 


ppiB 


P29820 


Translation 


Not essential 


tsf 


P34828 


Translation 


Essential 


MurD 


PI 4900 


Cell envelope 


Essential 


MurE 


P22188 


Cell envelope 


Essentia] 


AlgA 


P07874 


V iru lence/Colon ization 


Not essential 


metL 


P19358 


Amino acid biosynthesis - aspartate family 


Not essential 


fiisA 


XI 6278 


Translation 


Essential 


FlgE 


U09549 


Virulence/Colonization 


Not essential-motility impaired 


FliM 


M37691 


Virulence/Colonization 


Not essential-motility impaired 


MurC 


U32794 


Cell envelope 


Essential 


dnaE 


Ml 0040 


DNA replication 


Essential 


serS 


X05017 


Translation 


Essential 


g'y 


P00960 


Translation 


Essential 


gitx 


L 14580 


Translation 


Essential 


sig28 (fliA) 


M37691 


Regulatory functions 


Not essential-motility impaired 


sig54 




Regulatory functions 


Not essential-motility impaired 


Murl 


U 12405 


Cell envelope 


Essential 


dnaB 


D26185 


DNA replication 


In progress 


MurG 


D 10602 


Cell envelope 


In progress 


lpxC/envA 


U32794 


Cell envelope 


In progress 


kdtA 


M86305 


Cell envelope 


In progress 


IpxB 


U09549 


Cell envelope 


In progress 


KO 24 




thiolase-like 


In progress 


KO 26 




histone-like 


In progress 


KO 27 




respiratory chain NADH dehydrogenase- 
like 


In progress 
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VIII. Cloning, purification and characterization of the gene encoding the peptidvl- 
propvl cis-trans isomerase of K pylori 

The Helicobacter pylori genome contains an open reading frame (ORF) of 1 70 

amino acids was found to have homology with the Synechococcus sp. (strain PCC 

5 7942).ppi gene (NCBI Accession number P29820). Therefore, to evaluate whether this 

ORF encoded a protein with PPiase activity, the gene was isolated by polymerase chain 

reaction (PCR) amplification cloning, overexpressed in £. coli, and the protein purified 

to homogeneity. To facilitate purification, a polyhistidine tag was added to the N- 

terminus of this ORF. A simple assay using PPiase to evaluate protein folding function 

10 was developed for future use as a high- throughput drug screen. 

Currently, the class of PPIases is divided into three unrelated families: the 

cyclophilins, the FK506-binding (FKBPs) and the parvulins. Although PPiase mutants 

have been reported from yeast and fruit fly, attempts to isolate disruption mutants in 

Escherichia coli were unsuccessful (Shieh, B.H., et.al. (1989) Nature 338:67-70). This 

1 5 suggests that this activity is essential for viability in bacteria. 

Cloning, expression and protein purification 

To facilitate the cloning, expression and purification of ppi from H. pyloric a 
powerful gene expression system, the pET System, for cloning and expression of 
20 recombinant ppi in E. coli. In this study, the sequence for H. pylori ppi contains a DNA 
sequence encoding a His-Tag fused to the 5' end of the full length gene, because the 
protein product of this gene does not contain a signal sequence and is expressed as a 
cytosolic protein. 

A synthetic oligonucleotide primer ( 5 -TT ATGG ATCC AAACC AATT AAA A 
25 CT-3' (SEQ ID NO: 1287)) specific for the 5* end of ppi gene encoded a BamHI site at its 
extreme 5' terminus and a primer (S'-TATCTCGAGTTATAGAGAAGGGCO' (SEQ ID 
NO: 1288)) specific for the 3' end of the ppi gene encoded a Xhol site at its extreme 5' 
terminus. Genomic DNA prepared from the J99 strain of Helicobacter pylori was used 
as the source of template DNA for PCR amplification reactions (Current Protocols in 
30 Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et ah, editors, 1994). To 
amplify a DNA sequence containing the H. pylori ppi gene, genomic DNA (50 
nanograms) was introduced into a reaction vial containing 2 mM MgCl 2 , 1 micromolar 
synthetic oligonucleotide primers (forward and reverse primers) complementary to and 
flanking a defined H. pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, 
35 dGTP ; dCTP, dTTP and 2.5 units of heat stable DNA polymerase (Amplitaq, Roche 
Molecular Systems, Inc., Branchburg, NJ. USA) in a final volume of 100 microliters. 
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The following thermal cycling conditions were used to obtain amplified DN A products 
for each ORF using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 thermal cycler: 

Conditions for amplification of K pylori ppiB; 

5 

Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 32°C for 15 sec and 72°C for 1.5 min 
25 cycles at 94°C for 15 sec, 56°C for 15 sec and 72°C for 1.5 min 
Reactions were concluded at 72°C for 6 minutes. 

10 

Upon completion of thermal cycling reactions, the amplified DNA was washed 
and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, MD, 
USA). The amplified DNA sample was subjected to digestion with the restriction 
endonucleases, BamHI and Xhol (New England BioLabs, Beverly, MA, USA) (Current 

1 5 Protocols in Molecular Biology, Ibid). The DNA as subjected to electrophoresis on a 
1.0 % NuSeive (FMC BioProducts, Rockland, ME USA) agarose gel. DNA was 
visualized by exposure to ethidium bromide and long wave uv irradiation. DNA 
contained in slices isolated from the agarose gel was purified using the Bio 101 
GeneClean Kit protocol (Bio 101 Vista, CA, USA) 

20 Cloning, transformation, expression and purification of the PPI gene was carried 

out essentially as described in Examples II and III above. 

Assay for PPIase activity 

The assay for PPIase was essentially as described by Fisher (Fischer, G., et.al. 

25 (1 984) Biomed. Biochim. Acta 43:1101-1111). The assay measures the cis-trans 

isomerization of the Ala-Pro bond in the test peptide N-succinyl-Ala-Ala-Pro-Phe-p- 
nitroanilide (Sigma # S-7388, lot # 84H5805). The assay is coupled with a- 
chymotrypsin, where the ability of the protease to cleave the test peptide occurs only 
when the Ala-Pro bond is in trans. The conversion of the test peptide to the trans isomer 

30 in the assay is followed at 390 nm on a Beckman Model DU-650 spectophotometer. 
The data were collected every second with an average scanning of time of 0.5 second. 
Assays were carried out in 35 mM Hepes, pH 8.0, in a final volume of 400 ul, with 10 n 
M ot-chymotrypsin (type 1-5 from bovine Pancreas, Sigma # C-7762, lot 23H7020) and 
10 nM PPIase. To initiate the reaction, 10 |il of the substrate ( 2 mM N-Succinyl-Ala- 

35 Ala-Pro-Phe-p-nitroanilide in DMSO) was added to 390 ul of reaction mixture at room 
temperature. 
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PPiase assay in crude bacterial extract. 

A 50 ml culture of Helicobacter pylori (strain J99) in Brucella broth was 
harvested at mid-log phase (OD 600 nm - 1 ) and resuspended in lysis buffer with the 
following protease inhibitors: 1 mM PMSF, and 10 ng/ml of each of aprotinin, 
5 leupeptin, pepstatine, TLCK, TPCK, and soybean trypsin inhibitor. Ther suspension 
was subjected to 3 cycles of freeze-thaw (15 minutes at -70 °C, then 30 minutes at room 
temperature), followed by sonication (three 20 second bursts). The lysate was 
centrifuged (12,000 g x 30 minutes) and the supernatant was assayed for PPiase activity. 

10 Results 

PPI from K pylori was expressed in E. coli using the pET-28b expression vector 
from Novagen (cat # 69868-1). The expressed recombinant protein was isolated from 
the soluble fraction of bacterial cells that had been disrupted by cavitation in a 
Microfluidics Cell disruption chamber. The expression levels of recombinant PPI 
1 5 produced 1 00 mg of protein. The recombinant protein could be purified to homogeneity 
by Ni 2+ chelate chromatography and gel filtration. On sodium dodecyl sulfate 
polyacrylamide gels, the recombinant protein migrates as a single band at 21 kDa, in 
accordance with the predicted molecular weight of 20,975 deduced from the gene 
sequence. 

20 The PPiase activity was assayed using the chromogenic tetrapeptide substrate 

succinyl-Ala-Ala-Pro-Phe-p-nitroanilide. An initial velocity of 4.9 jamole/min/mg 
protein was measured with the purified enzyme (Figure 5). This corresponds to a k cat of 
1 .6 sec _1 which is similar to the one obtained for the £. coli PPiase (Liu, J, and Walsh, 
C.T. (1990) Proc.NatL Acad Set USA 87:4028-4032) and the one from porcine kidney 

25 (Fischer, G. (1 989) Nature 337:476-478). 

9 -\ -I 

The recombinant protein has a high catalytic efficiency of 2.06 X 10 M s 
when the assay is measured at 25°C. These values are one to two orders of magnitude 
higher than that observed for other characterized PPIases. However, in those studies, the 
ppiase assay was conducted at 10 0 C, which may account for the discrepency. The 

8 9-1-1 

30 calalytic efficiency is very close to the 1 X 10 to 1 X 10 M s upper diffusinal limit 
for "kinetically perfect" enzymes (Albery, WJ. and Knowles, J.R. (1976) Biochemistry 
1 5:5631 -5640) and suggests that by at least one measure, the H. pylori PPiase is a highly 
effective catalyst in the cis-trans isomerisation of the Ala-Pro bond in the oligopeptide 
substrate. 
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The presence of PPIase was also determined in an H. pylori extract. As with the 
assay for the recombinant protein, PPIase activity was detected, and was dependent on 
the concentration of extract added (Figure 6). 

These results show that PPIase activity can be measured on either H. pylori 
5 extracts or on the recombinant protein in E. coli. The high catalytic efficiency also 

demonstrates that H. pylori enzymes, such as PPIase, can be expressed at high levels and 
in an active form in E. coli. Such high yields of purified proteins provide for the design 
of various high throughput drug screening assays. 

10 IX. Cloning, purification, and characterization of the gene encoding the glutamate 
racemase of H. pylori. 

The Helicobacter pylori genome contains an open reading frame (ORF) of 255 

amino acids that was found to have homology to the Staphylococcus haemolyticus 

glutamate racemase gene (dga) (NCBI Accession number U 12405) and to the E. coli 

15 murl gene which encodes glutamate racemase activity in that organism. To evaluate 

whether this H. pylori ORF encodes a protein with glutamate racemase activity, the gene 
was isolated by polymerase chain reaction (PCR) amplification cloning, overexpressed 
in E. coli, and the protein purified to apparent homogeneity. A simple assay for 
glutamate racemase activity resulting in the isomerization of D-glutamic acid to L- 

20 glutamic acid was developed to facilitate purification and for future use as a high- 
throughput drug screen. 

The ORF in H. pylori has been found by gene disruption studies to be essential 
for viability of H. pylori cells in laboratory culture (see Example VII above). Therefore, 
inhibition of the enzymatic activity would be expected to be lethal for the organism, and 

25 such inhibitors may have utility in antimicrobial therapy of human infectious diseases. 

Cloning ofK pylori murl gene encoding glutamate racemase 

A 765 base pair DNA sequence encoding the murl gene of//, pylori was isolated 
by polymerase chain reaction (PCR) amplification cloning. A synthetic oligonucleotide 

30 primer (5 ' - AAATAGTC ATATG A A AAT AGGCGTTTTTG -3' (SEQ ID NO:1289)) 
encoding an Ndel restriction site and the 5' terminus of the murl gene and a primer 
(5 * - AG AATTCT ATT AC A ATTTG AGCC ATTCT -3* (SEQ ID NO:1290)) encoding an 
EcoRI restriction site and the 3' end of the murl gene were used to amplify the murl 
gene of K pylori using genomic DNA prepared from the J99 strain of//, pylori as the 

35 template DNA for the PCR amplification reactions (Current Protocols in Molecular 

Biology, John Wiley and Sons, Inc. F. Ausubel et al., editors, 1994). To amplify a DNA 
sequence containing the murl gene, genomic DNA (25 nanograms) was introduced into 
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each of two reaction vials containing 1 .0 micromole of each synthetic oligonucleotide 
primer, 2.0 mM MgCI 2 , 0.2 mM of each deoxy nucleotide triphosphate (dATP, dGTP, 
dCTP & dTTP), and 1.25 units of heat stable DNA polymerases (Amplitaq, Roche 
Molecular Systems, Inc., Branchburg, NJ„ USA) in a final volume of 50 microliters. 
5 The following thermal cycling conditions were used to obtain amplified DNA products 
for the murl gene using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 thermal 
cycler: 

Conditions for amplification of H. pylori murl; 

10 

Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 30°C for 30 sec and 72°C for 15 sec 
23 cycles at 94°C for 15 sec, 53°C for 30 sec and 72°C for 15 sec 
Reactions were concluded at 72°C for 20 minutes 

15 

Upon completion of thermal cycling reactions, the amplified DNA was washed 
and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, MD, 
USA). The amplified DNA sample was subjected to digestion with the restriction 
endonucleases, Ndel and EcoRI (New England Biolabs, Beverly, MA USA) (Current 

20 Protocols in Molecular Biology, Ibid). The DNA samples from each of two reaction 
mixtures were pooled and subjected to electrophoresis on a 1 .0% SeaPlaque (FMC 
BioProducts, Rockland, ME, USA) agarose gel. DNA was visualized by exposure to 
ethidium bromide and long wave uv irradiation. Amplified DNA encoding the K pylori 
murl gene was isolated from agarose gel slices and purified using the Bio 101 

25 GeneClean Kit protocol (Bio 101 Vista, CA, USA). 

Cloning ofK pylori DNA sequences into the pET-23 prokaryotic expression 
vector. 

The pET-23b vector can be propagated in any E. coli K-12 strain, e.g., HMS174, 
HB101 , JM109, DH5a, etc., for the purpose of cloning or plasmid preparation. Hosts 
for expression include E. coli strains containing a chromosomal copy of the gene for T7 
RNA polymerase. These hosts are lysogens of bacteriophage DE3, a lambda derivative 
that carries the lad gene, the lacUV5 promoter and the gene for T7 RNA polymerase. 
T7 RNA polymerase is induced by addition of isopropyl-B-D-thiogalactoside (IPTG), 
and the T7 RNA polymerase transcribes any target plasmid, such as pET-28b, carrying 
its gene of interest. Strains used in our laboratory include: BL21(DE3) (Studier, F.W., 
Rosenberg, A.H., Dunn, J.J., and Dubendorff, J.W. (1990) Meth. Enzymol. 185, 60-89). 

The pET-23b vector (Novagen, Inc., Madison, WI, USA) was prepared for 
cloning by digestion with Ndel and EcoRI (Current Protocols in Molecular Biology, 
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Ibid). Following digestion, the amplified, agarose gel-purified DNA fragment carrying 
the murl gene was cloned (Current Protocols in Molecular Biology, Ibid) into the 
previously digested pET-23b expression vector. Products of the ligation reaction were 
then used to transform the BL21(DE3) strain of E. coli. 

5 

Transformation of competent bacteria with recombinant plasmids 

Competent bacteria, E coli strain BL21 or E. coli strain BL21(DE3), were 
transformed with recombinant pET23-mwr/ expression plasmid carrying the cloned H. 
pylori sequence according to standard methods (Current Protocols in Molecular, Ibid). 

10 Briefly, 1 microliter of ligation reaction was mixed with 50 microliters of 

electrocompetent cells and subjected to a high voltage pulse, after which, samples were 
incubated in 0.45 milliliters SOC medium (0.5% yeast extract, 2.0 % tryptone, 10 mM 
NaCl, 2.5 mM KC1, 10 mM MgCI 2 , 10 mM MgS0 4 and 20 mM glucose) at 37°C with 
shaking for 1 hour. Samples were then spread on LB agar plates containing 100 

15 microgram/ml ampicillin for growth overnight. Transformed colonies of BL21 were 
then picked and analyzed to evaluate cloned inserts as described below. 

Identification of recombinant pET expression plasmids carrying H. pylori 
sequences 

20 Individual BL21 clones transformed with recombinant pET-23-mw/7 were 

analyzed by PCR amplification of the cloned inserts using the same forward and reverse 
primers, specific for each K pylori sequence, that were used in the original PCR 
amplification cloning reactions. Successful amplification verified the integration of the 
H. pylori sequences in the expression vector (Current Protocols in Molecular Biology, 

25 Ibid). 

Isolation and Preparation of plasmid DNA from BL21 transformants 

Colonies carrying pET-23-/wwr/ vectors were picked and incubated in 5 mis of 
LB broth plus 100 microgram/ml ampicillin overnight. The following day plasmid 
30 DNA was isolated and purified using the Qiagen plasmid purification protocol (Qiagen 
Inc., Chatsworth, CA, USA). 

Cloning and expression of the E. coli groE operon 

It has been demonstrated that coexpression of the E. coli murl gene with the 
35 genes in the E. coli groE operon reduces the formation of insoluble inclusion bodies 
containing recombinant glutamate racemase (Ashiuchi, M., Yoshimura, T., Kitamura, 
T., Kawata, Y., Nagai, J., Gorlatov, S., Esaki, N. and Soda, K., 1995, J, Biochem. 117, 
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495-498). The groE operon encodes two proteins, GroES (97 amino acids) and GroEL 
(548 amino acids), which are molecular chaperones. Molecular chaperones cooperate to 
assist the folding of new polypeptide chains (F. Ulrich Hartl , 1996, Nature London 381, 
pp. 571-580). 

5 The 2210 bp DNA sequence encoding the groE operon of E. coli (NCB1 

Accession number X07850) was isolated by polymerase chain reaction (PCR) 
amplification cloning. A synthetic oligonucleotide primer (5*-GCGAATTCGATCAG 
A ATTTTTTTTCT-3 * (SEQ ID NO: 1 291 )) encoding an EcoRI restriction site and the 5 * 
terminus of the groE operon containing the endogenous promoter region of the groE 

10 operon and a primer (5 '-ATAAGTACTTGTG AATCTTATACTAG -3' (SEQ ID 

NO: 1292)) encoding a Seal restriction site and the 3' end of the groEL gene contained in 
the groE operon were used to amplify the groE operon of E. coli using genomic DNA 
prepared from E. coli strain MG1655 as the template DNA for the PCR amplification 
reactions (Current Protocols in Molecular Biology, Ibid). To amplify a DNA sequence 

1 5 containing the E. coli groE operon, genomic DNA (12.5 nanograms) was introduced 
into each of two reaction vials containing 0.5 micromoles of each synthetic 
oligonucleotide primer, 1.5 mM MgCl 2 , 0.2 mM each deoxynucleotide triphosphate 
(dATP, dGTP, dCTP & dTTP) and 2.6 units heat stable DNA polymerases (Expanded 
High Fidelity PCR System, Boehringer Mannheim, Indianapolis, Indiana) in a final 

20 volume of 50 microliters. The following thermal cycling conditions were used to obtain 
amplified DNA products for the groE operon using a Perkin Elmer Cetus/ GeneAmp 
PCR System 9600 thermal cycler: 

Conditions for amplification and cloning of the E, coli groE operon; 

25 

Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 1 5 sec, 30°C for 30 sec and 72°C for 2 min 
23 cycles at 94°C for 15 sec, 55°C for 30 sec and 72°C for 2 min 
Reactions were concluded at 72°C for 8 minutes 

30 

Upon completion of thermal cycling reactions, the amplified DNA was washed 
and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, MD, 
USA). The amplified DNA sample was subjected to digestion with the restriction 
endonucleases, EcoRI and Seal (New England Biolabs, Beverly, MA USA) (Current 
35 Protocols in Molecular Biology, Ibid). The DNAs from each of two reaction mixtures 
were pooled and subjected to electrophoresis in a 1.0% SeaPlaque (FMC BioProducts, 
Rockland, ME, USA) agarose gel. DNA was visualized by exposure to ethidium 
bromide and long wave uv irradiation. DNA contained in slices isolated from the 
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agarsoe gel was purified using the Bio 101 GeneClean Kit protocol (Bio 101 Vista, CA, 
USA). 

A DNA fragment, EcoRI to Seal, containing the E. coli groE operon was cloned 
into the corresponding sites of the pACYC184 expression vector (New England Biolabs, 
5 Beverly, MA, USA) to make pACYC184-gro£;. The BL21(DE3) strain of E. coli was 
transformed with pACYC-groE. A tetracycline-resistant transformant overexpressing 
proteins of M r ~ 14,000 (GroES) and M r ~ 60,000 (GroEL) was isolated. 

Transformation of E. coli strain BL21(DE3) carrying the pACYC-groE plasmid 
10 ofE. coll 

Competent bacteria derived from a clone of strain BL21(DE3) carrying the 
pACYC-groE plasmid were transformed with 50 nanograms of pET23-murI plasmid 
DNA, isolated as described above (Current Protocols in Molecular Biology, Ibid). A 
clone of BL21(DE3) carrying both the pACYC-groE expression plasmid and the pET- 
15 23-murI plasmid was isolated and used for expression of recombinant glutamate 
racemase as described below. 

Expression of recombinant H. pylori murl 

A bacterial clone of BL21(DE3) carrying both the pACYC-groE expression 

20 plasmid and the pET-23-mw/7 plasmid was cultured in LB broth supplemented with 1.0 
mM D,L-glutamic acid and 100 microgram/ml ampicillin and 10 micrograms/ml 
tetracycline at 30°C until an optical density at 600 nM of 0.5 to 1 .0 O.D. units was 
reached, at which point, isopropyl-beta-D-thiogalactoside (IPTG) was added to the 
culture at a final concentration of 1.0 mM. Cells were cultured overnight to induce gene 

25 expression of the H. pylori recombinant DNA constructions . 

After induction of gene expression with IPTG, bacteria were pelleted by 
centrifugation in a Sorvall RC-3B centrifuge at 3000 x g for 20 minutes at 4°C. Pellets 
were resuspended in 50 milliliters of cold 10 mM Tris-HCl, pH 8.0, 0.1 M NaCl and 0.1 
mM EDTA (STE buffer). Cells were then centrifuged at 2000 x g for 20 min at 4°C. 

30 Pellets were weighed (average wet weight = 6 grams/liter) and processed to purify 
recombinant protein as described below. 

Purification of soluble glutamate racemase 

All steps were carried out at 4°C. Cells were suspended in 4 volumes of lysis 
35 buffer (50 mM Potassium phosphate, pH 7.0, 100 mM NaCl, 2 mM EDTA ? 2 mM 

EGTA, 10% glycerol, 10 mM D,L-glutamic acid, 0.1 % p-mercaptoethanol, 200 ml 
lysozyme, 1 mM PMSF, and 10 ug/ml each of leupeptin, aprotinin, pepstatin, L-l- 
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chloro-3-[4-tosylamidoj-7-amino-2-heptanone (TLCK), L- 1 -chloro-3-[4-tosylamido]-4- 
phenyI-2-butanone (TPCK), and soybean trypsin inhibitor, and ruptured by three 
passages through a small volume microfluidizer (Model M-l 10S, Microfluidics 
International Corporation, Newton, MA). The resultant homogenate was diluted with 1 
5 volume of buffer A (10 mM Tris-HCl pH 7.0, 0.1 mM EGTA, 10 % glycerol, 1 mM DL- 
Glutamic acid, 1 mM PMSF, 0.1% beta-mercaptoethanol), made 0.1 % Brij-35, and 
centrifuged (100,000 x g, 1 h) to yield a clear supernatant (crude extract). 

After filtation through a 0.80-um filter, the extract was loaded directly onto a 20 
ml Q-Sepharose column pre-equilibrated in buffer A containing 100 mM NaCl and 0.02 
10 % Brij-35. The column was washed with 100 ml (5 bed volumes) of Buffer A 

containing 100 mM NaCl and 0.02 % Brij-35, then developed with a 100-ml linear 
gradient of increasing NaCl (from 100 to 500 mM) in Buffer A. A band of M r = 28,000 
corresponding to glutamate racemase, the product of the recombinant H. pylori murl 
gene, eluted at a gradient concentration of approximately 200-280 mM NaCl, Individual 
1 5 column fractions were then characterized for glutamate racemase activity (see below for 
description of assay) and the protein profile of the fractions were analyzed on 12 % 
acrylamide SDS-PAGE gels. 

Fractions containing glutamate racemase were pooled, brought to 70% saturation 
with solid (NH4) 2 S0 4 , stirred for 20 min, and then centrifuged at 27,000 x g for 20 min. 
20 The resulting pellet was resuspended in lysis buffer to a final volume of 8 ml and loaded 
directly onto a 350-ml column (2.2 x 92 cm) of Sephacryl S-100HR gel filtration 
medium equilibrated in buffer B (10 mM Hepes pH 7.5, 150 mM NaCl, 0.1 mM EGTA, 
10% glycerol, 1 mM D,L-glutamatic acid, 0.1 mM PMSF, 0.1 % beta-mercaptoethanol) 
and run at 30 ml/h. Fractions found to contain a glutamate racemase activity were 
25 pooled, and 0.5 volume of buffer C (10 mM Tris pH 7.5, 0.1 mM EGTA, 10% glycerol, 
1 mM D,L-gIutamic acid, 0.1 mM PMSF, 0.1 % B-mercaptoethanol)was added (to 
reduce the NaCl concentration to 100 mM), and loaded onto a MonoQ 10/10 high- 
pressure liquid chromatography column equilibrated in buffer C containing 100 mM 
NaCl. The column was washed with 5 bed volumes of this buffer and developed with a 
30 40 ml linear gradient of increasing NaCl (from 100 to 500). Glutamate racemase eluted 
as a sharp peak at 3 10 mM NaCl. Fractions containing a glutamate racemase activity 
were pooled, concentrated by dialysis against storage buffer [50% glycerol, 10 mM 3- 
(N-morpholino-propanesulfonic acid (MOPS) pH 7.0, 150 mM NaCl, 0.1 mM EGTA, 
0.02 % Brij-35, 1 mM dithiothreitol (DTT)], and stored at -20°C. 

35 
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Assays for glutamate racemose activity. 

Conversion of D-glutamate to L-elutamate (two enzyme coupled assay) 

The activity of glutamate racemase, interconversion of the enantiomers of 
5 glutamic acid, was measured using D-glutamic acid as substrate. The method of Gallo 
and Knowles (Gallo, K.A. and Knowles, J.R., 1993, Biochmistry 32, 3981-3990) that 
was used to measure the glutamate racemase activity of Lactobacillus fermenti was 
adapted for the measurement of glutamate racemase activity of the H. pylori murl gene 
product isolated as a recombinant protein from E. coli. In this assay, the measurement 

10 of the activity of glutamate racemase is linked to an OD change in the visible range in a 
series of coupled reactions to the activities of L-glutamate dehydrogenase (reduction of 
NAD to NADH) and diaphorase (reduction of the dye p-iodonitrotetrazolium violet, 
INT). Initial rates were determined by following the increase in absorbance at 500 nm 
in a reaction volume of 200 fxl containing 50 mM Tris-HCl, pH 7.8, 4% v/v glycerol, 10 

15 mM NAD, 2 mM INT, 60 Units/ml L-glutamate dehydrogenase, 5 Units/ml diaphorase, 
and varying concentrations of either substrate (from 0.063 mM to 250 mM D-glutamic 
acid) or purified enzyme (from 1/jg to 50 ^ig). After a preincubation of all reagents 
except either the substrate (D-glutamic acid) or the enzyme (murl gene product) for a 
period of 5 minutes, reactions were initiated by adding the missing ingredient (i.e., the 

20 enzyme or the subtrate, as required), and the increase in optical density at 500 nm was 
measured in a Microplate Spectophotometer System (Molecular Devices, Spectra MAX 
250). Measurements were followed for 20 minutes, and initial velocities were derived 
by calculating the maximum slope for the absorbance increases. The coupled reaction 
can be summarized as shown below: 

25 

1) D-glutamate -> L-glutamate 

glutamate racemase 

2) L-glutamate + H 2 0 + NAD + -> 2-oxoglutarate + NH 3 + NADH 
30 L-glutamate dehydrogenase 

3) NADH + INT NAD* + formazan (color) 

diaphorase 

35 Conversion of D-glutamate to L-glutamate (single enzyme coupled assays 

In this assay, the conversion of D-glutamic acid to L-glutamic acid is coupled to 
the conversion of L-glutamic acid and NAD* by L-glutamate dehydrogenase to 2- 
oxoglutarate, ammonia. The production of NADH is measured as an increase of 



BNSDOCID: <WO 9737044A1 I > 



BNS oaae 129 



WO 97/37044 



PCTYUS97/05223 



- 128 - 

absorbance at 340 nm (the reduction of NAD" to NADH) at 37°C. The standard assay 
mixture (adapted from Choi, S-Y„ Esaki, N., Yoshimura, T., and Soda, K., 1991 , 
Protein Expression and Purification 2, 90-93) contained 10 mM Tris-HCl, pH 7.5, 5 mM 
NAD+, 5 Units/ml L-glutamate dehydrogenase, varying concentrations of the substrate 
5 D-Glutamic Acid (0.063 mM to 250 mM), and the purified recombinant H. pylori 

enzyme glutamate racemase (1 |ig to 50 jig). The reaction was started by the addition of 
either the substrate D-glutamic acid or the recombinant glutamate racemase after a 
preincubation at 37°C for 5 minutes with all of the other assay ingredients. The change 
in absorbance at 340 nm was measured in a Spectra MAX 250. Initial velocities were 
10 derived from the initial slopes. The coupled reactions can be summarized as shown 
below: 

1) D-glutamate — > L-glutamate 

glutamate racemase 

15 

2) L-glutamate + H 2 0 + NAD + -> 2-oxoglutarate + NH 3 + NADH 

L-glutamate dehydrogenase 

20 Results 

1) Expression of the K pylori murl gene in E. coli cells 

To examine its biochemical properties, the H. pylori glutamate racemase was 
overexpressed in E. coli and purified. In the presence of the E. coli chaperones GroES 

25 and GroEL, the glutamate racemase was expressed as a soluble protein. About 20 mg of 
soluble Murl was produced per liter of culture as judged by intensity of the protein band 
after SDS-PAGE. No band corresponding to the molecular weight of murl protein was 
seen in control gel lanes containing extracts from cells transformed with the pET vector 
lacking a murl insert. Addition of 1 mM DL-glutamic acid during cultivation of the 

30 expressing cells increased the apparent expression level by about five-fold. 

2) Purification of recombinant H. pylori murl protein 

Murl was purified by cation exchange chromatography and gel filtration. Upon 
SDS-PAGE analysis, the purified protein migrated as a single polypeptide species with 
35 an apparent mass 29 kDa which is consistent with the predicted mass of 28,858. 
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3) Kinetic properties of recombinant H. pylroi murl enzyme 

Kinetic constant for recombinant glutamate racemase were estimated by assaying 
its activity at various concentrations of protein and D-glutamic acid as described above. 
Purified recombinant H. pylori glutamate racemase exhibits a V max of - 300 
5 nmoles/min/mg protein (kcat = 8.6 min -1 ) and a K m of -1 00 fiM for D-glutamate. 
Although the Vmax value is lower than that observed for highly purified glutamate 
racemase from some other bacterial species, its Km for D-glutamic acid is higher than 
that observed for the enzyme from most other species, resulting in a catalytic efficiency 
(kcat/Km) which is typical of purified preparation from E. coli and P. Pentococcus. 

10 

4) Characterization of Murl: Inhibition by L-serine-O sulfate 

The H. pylori glutamate racemase was tested for inactivation with a sucuide 
inhibitor, L-serine-O sulfate, which is known to inhibit murl from E. coli. The enzyme 
was incubated in the presence of 20 mM L-serine-O sulfate, and at different times 

15 interval, aliquots were removed to determine residual activity. The initial velocity of 
purified recombinant H. pylroi murl protein was determined in the single enzyme 
coupled asssay following incubation with the inhibitor L-serine-O-sulfate (LSOS) at 20 
mM for the times indicated on the x-axis. The control was incubated in an identical 
manner but without LSOS. As shown in Figure 7, the H. pylori glutamate racemase can 

20 be readily inactivated by the inhibitor. 

Future application of the glutamate racemase activity in high throughput drug 
screening assays. 

The assays for measurement of H. pylori glutamate racemase activity described 
25 above have been carried out in 96-well plates in which multiple reactions were 

conducted simultaneously. Measurements of activity in a multi-well format are readily 
amenable to scale-up to permit rapid analysis of numerous compounds for inhibition of 
the glutamate racemase activity. Compunds which inhibit the activity of glutamate 
racemase may have application as novel antibiotics and may be suitable for the treatment 
30 and eradication of bacterial (e.g., //. pylori) infections in humans. Known inhibitors of 
glutamate racemase, such as L-serine-O-sulfate, can be used to calibrate high throughput 
screens of new compound libraries to facilitate identification of new compounds with 
properties suitable for in vivo human therapeutics. 
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EOUIVALENTS 

Those skilled in the art will recognize, or be able lo ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments and methods 
described herein. Such equivalents are intended to be encompassed by the scope of the 
5 following claims. 
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TTAAAAAAAG CGTTTAAGGA GAGGTTTTGC TCTCAAGTGT ATATCTCTTT TAATGTGGAT 12 0 

CACAATCTTT TATCCGCTCA AGTTT T AAGG GTTAAAAACC ACCGCATCAA AGAGAAATTT 130 

TTTAAAACTT TTGAGACTAA AGTGGAAACT AAAAATGGTG AAGTCCCCAT TCAAGCCTTA 240 

AAAATCGCCA GAACTTATAG CCAAAAATAC CCCTACACCT ATTTTAGCGC GATGAGTAAA 3 00 

GCTAAAGAGG TTTTATGCGA AAAGCAGGCG TTTGAACAAA TCAAACAAGA AAATCAAGAT 3 60 

TATCAGGCTT GTGAAGTCAA TCAAAAGTAT TGCGTTTATG TGGAATCTAA GGATTTTCTA 4 20 

AAGG A TTTC A AGCGTTTTAA AATCCAGGAT GTGGATTTTT TGTTTTCGCC CTTTAGCCTT 4 80 

ATTTATGATT TTGTGCGCGA TAATTTAGAA AACAAGCCGT TGTTGTATCT GCTTTTAGAG 540 

CGTTCAAGGT TTTATTTTTT GATTGCGGAT AAAAAAGAGA TTTTTTTAGC CAAATCCGTG 600 

TTTTTAGAAG AGCAACCTGA AGAGTTTATA GAGAGCAAAG AAGAAGATTC TATGGGAATG 6 60 

GATAATGAAG CTGTGGATTT GTTTTTGAGT GAAATCCAAG AAGATATTGA CAGCCTTGAA 7 20 

GAAGCGATAG GCCTAGACAG CAGCAAGGAT AATAGCGAAA AAATAACAGA GGACGCTTAT 7 80 

AGTTTGATTG AAGGCATGAC GAATATCCCC TTGATTGCAG ATGTTTTGCA AGAGGGATTG 840 

CGTGGCGTCT ATCATTCTAG AGAGATAGAC TTTGTAGAAA AAGTGGTTGT TTTAGACAGC 900 

TGTCAAATCC ACCACAAGGC GTTAATGCAT TTGCAAGAAA CTTTGATGAT AGAAGTGGAT 960 

AGGCTCGATT TTTCTTTAGT GGAGCGGTTG AATGTTTTAG CGCGCATGGA GAATGAAAAG 102 0 

C ATGCGTTT 1029 

(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 60 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f ea ture 

(B) LOCATION 1 . . .1260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 227: 

ATGATTTTTG GGGATTTTAA ATATCAAAAA AGCGTTAAAA AACTCACAGC CACTAACCTT 60 

AACGAGCTTA AAAACGCCCT GGATTTTATC TCTCAAAATA GGGGGAAGGG GTATTTTGTG 12 0 

GGGTATCTTT TGTATGAAGC GCGCTTAGCG TTTTTAGATG AAAATTTTCA AAGCCAAACC 180 

CCTTTTTTGT ATTTTGAACA ATTTTTAGAA AGAAAAAAAT ACCTTGTAGA GCATCTAAAA 2 40 

GAGCATGCGT TTTACCCTAA AATCCACAGC TCTTTAGATC AAAAAACTTA TTTCAAGCAG 3 00 

TTTAAGGCCG TTAAAGAGCA TCTCAAAAAC GGCGATACCT ATCAAGTGAA TTTGACAATG 3 60 

GATTTATTGT TAAACACTAA GGCCAAACCA AAGCGCGTTT TTAAGGAAGT GATACACAAC 420 

CAAAACACGC CTTTTAAGGC TTTTATAGAA AATGAGTTTG GGAGCGTTTT AAGCTTTTCG 4 80 

CCGGAATTGT TTTTTGAATT AGAGTTTTTA GACACAGCGA TTAAGATTAT CACAAAACCC 540 

ATG AAAGGC A CGATCGCTCG CTCAAACAAC CCCTTGATAG ATGAAAAAAA CCGATTGTTT 600 

TTGCAAAATG ATGACAAAAA TAGAAGCGAA AATGTGATGA TCGTGGATTT ATTGCGTAAC 660 

GATTTGAGCC GCTTGGCCTT AAAAAATAGC GTGAAAGTCA ATCAATTGTT TGAAATCATC 720 

AGTTTGCCGA GCGTGTATCA AATGATAAGC GAGATTGAAG CTCAATTGCC CCTAAAAACA 7 80 

AGCTTGTTTG AGATTTTTAA GGCGTTGTTC CCTTGCGGCT CTGTGACCGG ATGCCCTAAA 840 

ATCAAAACCA TGCAAATCAT TGAAAGTTTA GAAAAACGCC CTAGGGGGGT GTATTGTGGG 9 00 

GCGATAGGCA TGGTTGGAGG AAAAAAAGCC CTTTTTAGCG TGCCTATCCG CACTTTAGAA 960 
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AAAAGAGCGC ATGAAGATTT TTTGCATTTA GGGGTAGGGA GTGGGGTAAC TTATAAAAGT 
AAAGCTTCAA AGGAATATGA AGAGAGCTTT TTAAAATCCT TTTTTGTGAT GCCCAAAATA 
GAATTTGAGA TCGTGGAAAC GATGAGAGTT ATCAAAAGGG ATC AAAAATT AGAGATTAAC 
AATAAAAACG CCCATAAAGA ACGCTTAATG CATAGCGCCC AATATTTTAA CTTTAATACA 
GATGAAAATC TTTTAGACTT TGAATTAGAA GGAGAAGGGG TTTTAAGGGT TTTAATTCAA 



(2) INFORMATION FOR SEQ ID NO: 228: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME / KEY : mi sc_ feature 

(B) LOCATION 1...855 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 8: 

AAACAAATAA GAAAAGAACG CTTGAACAAA CTGCTTAAAA AGGGGTTTTT AGCGTTCTTT 60 

TTAAGCGTGT ATTTAAGGGC TGATGATCTG GTTACTTATA CGATCGCCAA AGAAGAAGAT 12 0 

CTAGGGTACC AGCGGTTTTT AGCCAAGAAG TGTTTAAGGG GTAAAACCCA CCCCCCATGT 180 

TTTACCGCGT CTAAAAAGCC CAAGAGGAAA CCTTTTAATA TAGATAAAAG CTCCCATTAT 240 

TATGGCACGA GCGTGGTGCA AATGTCATGG CTACAGAGTA GGG AAAAATT TGAAAACCAT 300 

TCAAAATACC GAAACATTCC TTTTGCTGAA GTCAGTTTGA TTTATGGCTA TAAACAATTT 360 

TTTCCTAAAA AAGAGCACTA CGGCTTCCGT TTTTATGTCT C TTTGG ATT A CGCTTATGGC 42 0 

TTTTTTCTTA AAAATAAAGG CGCATTAGGC GATAGTTTGA GGGCGAGTTC GCAGATCCCT 4 80 

AAAAGCTATA GAG AAAAATT ACAAAGAAAA GAGACTTTTA TTAACGCTAT TTTTTATGGC 540 

GTGGGGGCTG ACTTTTTATA CAAACGCGCT TTTGGGACGC TGATTTTAGG GGTGAATTTC 600 

GTGGGAGAAA CCTGGTTTTA TGAAACAAAG ATTTTCAAAC AATGGGCTAA AGACTCTCTA 6 60 

AATACTTACC GCCCCAACAT GTTTCAAGTG ATGTTGAATG TGGGGTATCG CTACCGCTTT 720 

TCAAGGTATA AGAATTGGGC GATAGAATTT GGCGCGCGCA TCCCCTTTTT AATTAATGAT 7 80 

TATTTTAAAA CCCCTTTATA CACCCTTCAT TTCAAACGCA AT ATTTC TGT CTATCTCACT 84 0 

TCAACTTATG ATTTT 855 



(2) INFORMATION FOR SEQ ID NO: 229: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) H Y POTHET I C AL ; NO 
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GTCAAATACG AAGATAAGGG CGAACCTCTC AAAATCACAT GGCATTCTAA TTGCCATGCC 4 20 

TTAAGGGTGG CTAAAGTGAT TGACTCGGCG AAAAATCTCA TCAGACAGCT TAAAAATGTG 480 

GAACTCATTG AATTGGAAAA AGAAGAAGAA TGCTGCGGGT TTGGGGGGAC TTTTTCAGTT 54 0 

AAAGAGCCTG AAATTTCAGC GGTTATGGTC AAAGAAAAGA TTAAAGACAT AGAGAGCCGT 600 

CATGTGGATG TGATCGTTTC AGCGGATGCT GGGTGCTTGA TGAATATCAG CACCGCTATG 6 60 

CAAAAAATGG GCTCTTTGAC AAAACCCATG CATTTTTATG AC TTTTT AGC CTCAAGGCTT 72 0 

GGACTTTAA 72 9 

(2) INFORMATION FOR SEQ ID NO: 3 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc„feature 

(B) LOCATION 1...1206 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 45: 

ATGATTTTTG GGGATTTTAA ATATCAAAAA AGCGTTAAAA AACTCACAGC CACTAACCTT 60 

AACGAGCTTA AAAACGCCCT GGATTTTATC TCTCAAAATA GGGGGAAGGG GTATTTTGTG 120 

GGGTATCTTT TGTATGAAGC GCGCTTAGCG TTTTTAGATG AAAATTTTCA AAGCCAAACC 180 

CCTTTTTTGT ATTTTGAACA ATTTTTAGAA AGAAAAAAAT ACCCTTTAGA GCCTTTAAAA 240 

GAGCATGCGT TTTACCCTAA AATCCACAGC TCTTTAGATC AAAAAACTTA TTTCAAGCAG 3 00 

TTTAAGGCCG TTAAAGAGCA TCTCAAAAAC GGCGATACCT ATCAAGTGAA TTTGACAATG 360 

GATTTATTGT TAAACACTAA AGCCAAACCA AAGCGCGTTT TTAAGGAAGT GATACACAAC 420 

CAAAACACGC CTTTTAAGGC TTTTATAGAA AATGAGTTTG GGAGCGTTTT AAGCTTTTCG 4 80 

CCGGAATTGT TTTTTGAATT AGAGTTTTTA GACACAGCGA TTAAGATTAT CACAAAACCC 540 

ATGAAAGGCA CGATCGCTCG CTCAAACAAC CCCTTGATAG ATGAAAAAAA CCGATTGTTT 600 

TTGCAAAATG ATGACAAAAA TAGAAGCGAA AATGTGATGA TCGTGGATTT ATTGCGTAAC 6 60 

GATTTGAGCC GCTTGGCCTT AAAAAATAGC GTGAAAGTCA ATCAATTGTT TGAAATCATC 720 

AGTTTGCCGA GCGTGTATCA AATGATAAGC GAGATTGAAG CTCAATTGCC CCTAAAAACA 7 80 

AGCTTGTTTG AGATTTTTAA GGCGTTGTTC CCTTGCGGCT CTGTGACCGG ATGCCCTAAA 840 

ATCAAAACCA TGCAAATCAT TGAAAGTTTA GAAAAACGCC CTAGGGGGGT GTATTGTGGG 900 

GCGATAGGCA TGGTTGGAGG AAAAAAAGCC CTTTTTAGCG TGCCTATCCG CACTTTAGAA 9 60 

AAAAGAGCGC ATGAAGATTT TTTGCATTTA GGGGTAGGGA GTGGGGTAAC TTATAAAAGT 102 0 

AAAGCTTCAA AGGAATATGA AGAGAGCTTT TTAAAATCCT TTTTTGTGAT GCCCAAAATA 10 80 

GAATTTGAGA TCGTGGAAAC GATGAGAGTT ATCAAAAGGG ATCAAAAATT AGAGATTAAC 1140 

AATAAAAACG CCCATAAAGA ACGCTTAATG CATAGCGCCC AATATTTTAA CTTTAAATAC 12 00 

AGATGA 1206 

(2) INFORMATION FOR SEQ ID NO: 3 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 



BNSDQCID: <WO 9737044A1 I > 



WO 97/37044 

909 



PCT/US97/05223 



720 



GAAAATAGGG ATAATGTGGA AAAACAAGCG ATTAGAGATC CTAATATCAA AGAATTCGCA 

TGCGGGAAGT GGGTTTATGA TGATGAAAAT TTACAAGCCT ATCGCCCAAG CATTTTAAAA 7 80 

CGCGTTGATA AAGAC AAAG A GATTACAACA GACATTACCC CTTGCGATTA CAGCACCGCT 84 0 

GAAAATAAAA GCGGTAAAAT CATTACCCCC TATACTAAAA TCTCTGTTCA TAAAACAGAG 900 

CCTTTAGAAG ACCCTCAAAC TTTTGAAGCT AAAAATAATT TCGCCATTCT CCAAGCCAGA 9 60 

AGCTCTACAG AAAAGTGTAA AAGGGCTAGA GCGAGAAAAG ACGGCACGAC CAGGCAATGC 1020 

TATCTGATAG AAGAGCCTTT AAAACAAGCG TGGGAGAGCG AGTATGAAAT CACCACGCAA 1080 

TTAGTGAAAG CCATATATGA ACGCCCCAAA CAAGACGATC AAGTAGAGCC GACTTTTTAT 1140 

GAAACCAGCG AATTGGCTTA TTCTTCCACA CGAAAAAGCG AAATAACGCG TAATGAATTG 1200 

AATTTGAATG AAAAATTCAT GGAATTTGTG GAAGTGTATG AGGGGCATTA TTTAAACGAT 1260 

ATAATCAAAG AGAGCAGCGA ATACAAAGAA TGGGTTAAAA ACCATGTGCG CTTTAAAGAA 1320 

GGGGTGTGC A TGGCCTTAGA AATAGAAGAG CAACCACGAG CCAAAAGCAC GCCTTTGAGT 13 80 

ATTGAAAACT CTCGTGTTGT GTGTGTCAAA AAGGGGAATT ATTTATTCAA TGAAGTTTAA 144 0 

(2) INFORMATION FOR SEQ ID NO: 1003: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1704 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc__f eature 

(B) LOCATION 1. . .1704 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1003: 

ATGATTTTTG GGGATTTTAA ATATCAAAAA AGCGTTAAAA AACTCACAGC CACTAACCTT 60 

AACGAGCTTA AAAACGCCCT GGATTTTATC TCTCAAAATA GGGGGAAGGG GTATTTTGTG 120 

GGGTATCTTT TGTATGAAGC GCGCTTAGCG TTTTTAGATG AAAATTTTCA AAGCCAAACC 180 

CCTTTTTTGT ATTTTGAACA ATTTTTAGAA AGAAAAAAAT ACCCTTTAGA GCCTTTAAAA 240 

GAGCATGCGT TTTACCCTAA AATCCACAGC TCTTTAGATC AAAAAACTTA TTTCAAGCAG 3 00 

TTTAAGGCCG TTAAAGAGCA TCTCAAAAAC GGCGATACCT ATCAAGTGAA TTTGACAATG 360 

GATTTATTGT TAAACACTAA AGCCAAACCA AAGCGCGTTT TTAAGGAAGT GATACACAAC 420 

CAAAACACGC CTTTTAAGGC TTTTATAGAA AATGAGTTTG GGAGC GTTTT AAGCTTTTCG 480 

CCGGAATTGT TTTTTGAATT AGAGTTTTTA GACACAGCGA TTAAGATTAT CACAAAACCC 540 

ATGAAAGGCA CGATCGCTCG CTCAAACAAC CCCTTGATAG ATGAAAAAAA CCGATTGTTT 600 

TTGCAAAATG ATGACAAAAA TAGAAGCGAA AATGTGATGA TCGTGGATTT ATTGCGTAAC 660 

GATTTGAGCC GCTTGGCCTT AAAAAATAGC GTGAAAGTCA ATCAATTGTT TGAAATCATC 720 

AGTTTGCCGA GCGTGTATCA AATGATAAGC GAGATTGAAG CTCAATTGCC CCTAAAAACA 780 

AGCTTGTTTG AGATTTTTAA GGCGTTGTTC CCTTGCGGCT CTGTGACCGG ATGCCCTAAA 840 

ATCAAAACCA TGCAAATCAT TGAAAGTTTA GAAAAACGCC CTAGGGGGGT GTATTGTGGG 900 

GCGATAGGCA TGGTTGGAGG AAAAAAAGCC CTTTTTAGCG TGCCTATCCG CACTTTAGAA 960 

AAAAGAGCGC ATGAAGATTT TTTGCATTTA GGGGTAGGGA GTGGGGTAAC TTATAAAAGT 1020 

AAAGCTTCAA AGGAATATGA AG AGAG CTTT TTAAAATCCT TTTTTGTGAT GCCCAAAATA 1080 

GAATTTGAGA TCGTGGAAAC GATGAGAGTT ATCAAAAGGG ATCAAAAATT AGAGATTAAC 1140 

AATAAAAACG CCCATAAAGA ACGCTTAATG CATAGCGCCC AATATTTTAA C TTT AAAT AC 1200 
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GATGAAAATC TTTTAGACTT TGAATTAGAA AAAGAAGGGG TTTTAAGGGT TTTACTCAAT 1260 

AAAAGGGGCA AGCTCATTAA AGAATACAGA ACCTTAGAGC CTTTAAAAAG CCTAGAAATC 1320 

CGTTTGAGTG AAGCCCCCAT TGACAAACAC GATGATTTTT TATACCATAA GACCGCTTAT 13 80 

GCCCCTTTTT ATCAAAACGC TCGAGCGCTC ATTAAAAAAG GCGTTATTTT TGATGAAATC 14 40 

TTTTATAACC AGGATTTGGA ACTCACTGAG GGCGCTAGGA GTAATCTTAT TTTAGAAATC 15 00 

CACAACAGGC TTTTAACCCC TTATTTCAGC GTGGGCGCAT TAACCGGGAC GGGCGTTGTG 15 60 

GGGTTGTTAA AAAAAGGTCT TGTTGAGCAT GCCCCTTTAA AATT AC AAG A CTTGCAAAGA 16 20 

GCGGCTAAAA TCTATTGTAT TAACGCGCTA TACGGCTTAG TGGAGGTGGG AATAATAGGT 16 80 

TACCCAATGG AGCAAAAAAG TTAA 17 04 

(2) INFORMATION FOR SEQ ID NO: 1004: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1125 base pairs 
{ B ) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc.feature 

(B) LOCATION 1...1125 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1004: 



ATGAAAATCA 


GTGTTAGTAA 


AAACGATTTA 


GAAAACACTT 


TGCGCTACTT 


GCAAGCTTTT 


60 


TTGGATAAAA 


AGG AC GCTTC 


CTCTATCGCT 


TCACACATCC 


ATTTAGAAGT 


CATTAAAGAA 


120 


AAGCTC TTTT 


TAAAAGCCAG 


CGATTCAGAT 


ATTGGACTAA 


AAAGCTATAT 


TTCTACGCAA 


180 


TCTACCGATA 


AAGAGGGCGT 


AGGCACGATC 


AACGGGAAGA 


AGTTTTTAGA 


TATTATCTCA 


240 


TGTTTGAAAG 


ACTCTAATAT 


TGTTTTAGAG 


ACTAAAGATG 


ACAGCTTGGT 


GATCAAACAA 


300 


AATAAAAGCT 


CTTTCAAACT 


CCCCATGTTT 


GACGCTGATG 


AGTTCCCTGA 


ATTCCCTGTT 


360 


ATTGATCCAA 


AAGTGAGTTT 


AGAAATCAAT 


GCCCCCTTTT 


TGGTGGATGC 


GTTTAAAAAG 


420 


ATCGCTCCTG 


TGATTGAGCA 


AACTAGCCAT 


AAAAGGGAAT 


TAGCCGGTGT 


TTTAATGCAA 


480 


TTCAATCAAA 


AACACCAAAC 


CCTTTCAGTG 


GTAGGCACGG 


ATACCAAGCG 


GCTCTCTTAC 


540 


ACGCAGTTAG 


AAAAAATCTC 


TATCCATTCC 


ACCGAAGAAG 


ACATCTCTTG 


CATTTTGCCT 


600 


AAAAGGGCTT 


TATTGGAAAT 


CCTTAAACTT 


TTTTATGAAA 


ATTTCAGTTT 


TAAAAGCGAC 


660 


GGCATGTTAG 


CGGTGGTTGA 


AAACGAAACG 


CACGCTTTTT 


TCACCAAGCT 


CATTGATGGG 


720 


AATTACCCTG 


ATTATCAAAA 


AATCCTCCCT 


AAAGAATACA 


CTTCTTCTTT 


CACTTTAGGC 


780 


AAGGAAGAAT 


TTAAAGAGGG 


CATTAAGTTG 


TGCAGTTCTT 


TAAGCTCCAC 


CATTAAGCTC 


840 


ACTTTAGAAA 


AAAACAACGC 


TTTGTTTGAA 


TCTTTGGATT 


C TGAGC AT AG 


CGAAACCGCC 


900 


AAAACCTCTG 


TTGAGATTGA 


AAAAGGTTTG 


GATATTGAAA 


AAGCCTTTCA 


TTTGGGCGTG 


960 


AACGCAAAAT 


TTTTC CTTG A 


AGCCTTAAAC 


GCTTTAGGGA 


CAACGCAATT 


TGTTTTAAAA 


1020 


TGCAATGAGC 


CTTCTTCGCC 


TTTTTTGATT 


CAAGAACCTC 


TTGATGAAAA 


GCAAAGCCAC 


1080 


TTGAACGCTA 


AAATTTCCAC 


TTTGATGATG 


CCAATCACAC 


TATAA 




1125 



(2) INFORMATION FOR SEQ ID NO: 1005: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 base pairs 
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CLAIMS 

1 . An isolated nucleic acid comprising a nucleotide sequence encoding an 
H. pylori polypeptide selected from the group consisting of SEQ ID NO: 492-SEQ ID 

5 NO: 759, SEQ ID NO: 761, SEQ ID NO: 763, SEQ ID NO: 765-SEQ ID NO: 818, SEQ 
ID NO: 820-SEQ ID NO: 846, SEQ ID NO: 848-SEQ ID NO: 896, SEQ ID NO: 898- 
SEQ ID NO: 963, SEQ ID NO: 966-SEQ ID NO: 982, SEQ ID NO: 1037, SEQ ID NO: 
1038, SEQ ID NO: 1041-SEQ ID NO: 1087, SEQ ID NO.1090 and SEQ ID NO: 1296- 
SEQ ID NO: 1298. 

10 

2. A recombinant expression vector comprising the nucleic acid of claim 1 
operably linked to a transcription regulatory element. 

3. A cell comprising a recombinant expression vector of claim 2. 

15 

4. A method for producing an H. pylori polypeptide comprising culturing a 
cell of claim 3 under conditions that permit expression of the polypeptide. 

5. A probe comprising a nucleotide sequence consisting of at least 8 

20 nucleotides of a nucleotide sequence selected from the group consisting of SEQ ID NO: 
1-SEQ ID NO: 268, SEQ ID NO: 270, SEQ ID NO: 272, SEQ ID NO: 274-SEQ ID NO. 
327, SEQ ID NO: 329-SEQ ID NO: 364, SEQ ID NO: 366-SEQ ID NO: 405, SEQ ID 
NO: 407-SEQ ID NO: 472, SEQ ID NO: 475-SEQ ID NO: 491, SEQ ID NO: 983, SEQ 
ID NO: 984, SEQ ID NO: 987-SEQ ID NO: 1033, SEQ ID NO: 1036 and SEQ ID NO: 

25 1293-SEQ ID NO: 1295 or the complement thereof. 

6. An isolated nucleic acid comprising a nucleotide sequence of at least 8 
nucleotides in length, wherein the sequence is hybridizable to a nucleic acid having a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 1-SEQ ID NO: 

30 268, SEQ ID NO: 270, SEQ ID NO: 272, SEQ ID NO: 274-SEQ ID NO: 327, SEQ ID 
NO: 329-SEQ ID NO: 364, SEQ ID NO: 366-SEQ ID NO: 405, SEQ ID NO: 407-SEQ 
ID NO: 472, SEQ ID NO: 475-SEQ ID NO: 491, SEQ ID NO: 983, SEQ ID NO: 984, 
SEQ ID NO: 987-SEQ ID NO: 1033, SEQ ID NO: 1036 and SEQ ID NO: 1293-SCQ ID 
NO: 1295 or the complement thereof. 

35 
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7. A vaccine composition for prevention or treatment of an H. pylori 
infection comprising an effective amount of an isolated nucleic acid comprising a 
nucleotide sequence encoding an H pylori polypeptide or a fragment thereof, said 
nucleic acid comprising a nucleotide sequence selected from the group consisting of 
SEQ ID NO: 1-SEQ ID NO: 268, SEQ ID NO: 270, SEQ ID NO: 272, SEQ ID NO: 
274-SEQ ID NO: 327, SEQ ID NO: 329-SEQ ID NO: 364, SEQ ID NO: 366-SEQ ID 
NO: 405, SEQ ID NO: 407-SEQ ID NO: 472, SEQ ID NO: 475-SEQ ID NO: 491, SEQ 
ID NO: 983, SEQ ID NO: 984, SEQ ID NO: 987-SEQ ID NO: 1033, SEQ ID NO: 1036 
and SEQ ID NO: 1293-SEQ ID NO: 1295. 

8. A vaccine composition of claim 7, further comprising a pharmaceutically 
acceptable carrier. 

9. A vaccine composition of claim 8, wherein the pharmaceutically 
1 5 acceptable carrier is an adjuvant. 

10. A method of treating a subject for H. pylori infection comprising 
administering to a subject a vaccine composition of claim 7, such that treatment of H. 
pylori infection occurs. 

20 

11. A method of claim 10, wherein the treatment is a prophylactic treatment. 

12. A method of claim 10, wherein the treatment is a therapeutic treatment. 

25 13. A method for detecting the presence of a Helicobacter nucleic acid in a 

sample comprising: 

(a) contacting a sample with a probe of claim 5 under conditions in 
which a hybrid can form between the probe and a Helicobacter nucleic acid in the 
sample; and 

30 (b) detecting the hybrid formed in step (a), wherein detection of a 

hybrid indicates the presence of a Helicobacter nucleic acid in the sample. 

14. A recombinant or substantially pure preparation of an H. pylori 
polypeptide selected from the group consisting of SEQ ID NO: 492-SEQ ID NO: 759, 
35 SEQ ID NO: 761, SEQ ID NO: 763, SEQ ID NO: 765-SEQ ID NO: 818, SEQ ID NO: 
820-SEQ ID NO: 846, SEQ ID NO: 848-SEQ ID NO: 896, SEQ ID NO: 898-SEQ ID 
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NO: 963, SEQ ID NO: 966-SEQ ID NO: 982, SEQ ID NO: 1037. SEQ ID NO: 1038, 
SEQ ID NO: 1 04 1 -SEQ ID NO: 1 087, SEQ ID NO: 1 090 and SEQ ID NO: 1 296-SEQ 
ID NO: 1298. 

5 1 5. A vaccine composition for prevention or treatment of an H. pylori 

infection comprising an effective amount of a purified H. pylori polypeptide or a 
fragment thereof, wherein said H. pylori polypeptide is selected from the group 
consisting of SEQ ID NO: 492-SEQ ID NO: 759, SEQ ID NO: 761, SEQ ID NO: 763, 
SEQ ID NO: 765-SEQ ID NO: 8 1 8, SEQ ID NO: 820-SEQ ID NO: 846, SEQ ID NO: 

1 0 848-SEQ ID NO: 896, SEQ ID NO: 898-SEQ ID NO: 963, SEQ ID NO: 966-SEQ ID 
NO: 982, SEQ ID NO: 1037, SEQ ID NO: 1038, SEQ ID NO: 1041-SEQ ID NO: 1087, 
SEQ ID NO:1090 and SEQ ID NO: 1296-SEQ ID NO: 1298. 

16. A vaccine composition of claim 1 5, further comprising a 
15 pharmaceutical^ acceptable carrier. 

17. A vaccine composition of claim 16, wherein the pharmaceutically 
acceptable carrier is an adjuvant. 

20 18. A method of treating a subject for H. pylori infection comprising 

administering to a subject a vaccine composition of claim 15, such that treatment ofH. 
pylori infection occurs. 

19. A method of claim 1 8, wherein the treatment is a prophylactic treatment. 

25 

20. A method of claim 1 8, wherein the treatment is a therapeutic treatment. 

21 . An isolated nucleic acid comprising a nucleotide sequence encoding an 
H. pylori cell envelope polypeptide or a fragment thereof, said nucleic acid selected 

30 from the group consisting of SEQ ID NO: 255, SEQ ID NO: 263, SEQ ID NO: 266, 

SEQ ID NO: 277, SEQ ID NO: 280, SEQ ID NO: 285, SEQ ID NO: 292, SEQ ID NO: 
294, SEQ ID NO: 299, SEQ ID NO. 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ ID 
NO: 321, SEQ ID NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, SEQ ID NO: 353, SEQ 
ID NO: 364, SEQ ID NO: 366, SEQ ID NO: 368, SEQ ID NO: 375, SEQ ID NO: 384, 

35 SEQ ID NO: 391, SEQ ID NO: 392, SEQ ID NO: 397, SEQ ID NO: 398, SEQ ID NO: 
402, SEQ ID NO: 404, SEQ ID NO. 409, SEQ ID NO: 410, SEQ ID NO: 412. SEQ ID 
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NO: 427, SEQ ID NO: 433, SEQ ID NO: 434, SEQ ID NO: 441, SEQ ID NO: 444, SEQ 
ID NO: 445, SEQ ID NO: 449, SEQ ID NO: 450, SEQ ID NO: 452, SEQ ID NO: 453, 
SEQ ID NO: 466, SEQ ID NO: 468, SEQ ID NO: 469, SEQ ID NO: 983, SEQ ID NO: 
989, SEQ ID NO: 1008, SEQ ID NO: 101 1, SEQ ID NO: 1014, SEQ ID NO: 1015, SEQ 
5 ID NO: 1029, SEQ ID NO: 1032, SEQ ID NO: 259, SEQ ID NO: 286, SEQ ID NO: 
326, SEQ ID NO: 374, SEQ ID NO: 399, SEQ ID NO: 422, SEQ ID NO: 454, SEQ ID 
NO: 465, SEQ ID NO: 998, SEQ ID NO: 1009, SEQ ID NO: 1023, SEQ ID NO: 1294, 
SEQ ID NO: 1295, SEQ ID NO: 319, SEQ ID NO: 325, SEQ ID NO: 425, SEQ ID NO: 
437, SEQ ID NO: 438, SEQ ID NO: 447, SEQ ID NO: 448, SEQ ID NO: 467, SEQ ID 

10 NO: 996, SEQ ID NO: 1027, SEQ ID NO: 1031, SEQ ID NO: 254, SEQ ID NO: 352, 
SEQ ID NO: 415, SEQ ID NO: 1019, SEQ ID NO: 381, SEQ ID NO: 389, SEQ ID NO: 
1010, SEQ ID NO: 1012, SEQ ID NO: 354, SEQ ID NO: 372, SEQ ID NO: 400, SEQ 
ID NO: 421, SEQ ID NO: 1022, SEQ ID NO: 463, SEQ ID NO: 281, SEQ ID NO: 988, 
SEQ ID NO: 41 1, SEQ ID NO: 407, SEQ ID NO: 1017, SEQ ID NO: 290, SEQ ID NO: 

15 417, SEQ ID NO: 430, SEQ ID NO: 992, SEQ ID NO: 1025, SEQ ID NO: 477, SEQ ID 
NO: 414, SEQ ID NO: 253, SEQ ID NO: 293, SEQ ID NO: 334, SEQ ID NO: 343, SEQ 
ID NO: 418, SEQ ID NO: 424, and SEQ ID NO: 443. 

22. The purified nucleic acid of claim 21 , wherein said H. pylori cell 

20 envelope polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide 
or a fragment thereof encoded by the nucleic acid selected from the group consisting of 
SEQ ID NO: 255, SEQ ID NO: 263, SEQ ID NO: 266, SEQ ID NO: 277, SEQ ID NO: 
280, SEQ ID NO: 285, SEQ ID NO: 292, SEQ ID NO: 294, SEQ ID NO: 299, SEQ ID 
NO: 3 1 1 , SEQ ID NO: 3 1 2, SEQ ID NO: 313, SEQ ID NO: 32 1 , SEQ ID NO: 327, SEQ 

25 ID NO: 329, SEQ ID NO: 331, SEQ ID NO: 353, SEQ ID NO: 364, SEQ ID NO: 366, 
SEQ ID NO: 368, SEQ ID NO: 375, SEQ ID NO: 384, SEQ ID NO: 391, SEQ ID NO: 
392, SEQ ID NO: 397, SEQ ID NO: 398, SEQ ID NO: 402, SEQ ID NO: 404, SEQ ID 
NO: 409, SEQ ID NO: 410, SEQ ID NO: 412, SEQ ID NO: 427, SEQ ID NO: 433, SEQ 
ID NO: 434, SEQ ID NO: 441, SEQ ID NO: 444, SEQ ID NO: 445, SEQ ID NO: 449, 

30 SEQ ID NO: 450, SEQ ID NO: 452, SEQ ID NO: 453, SEQ ID NO: 466, SEQ ID NO: 
468, SEQ ID NO: 469, SEQ ID NO: 983, SEQ ID NO: 989, SEQ ID NO: 1008, SEQ ID 
NO: 1011, SEQ ID NO: 1014, SEQ ID NO: 1015, SEQ ID NO: 1029, SEQ ID NO: 
1032, SEQ ID NO: 259, SEQ ID NO: 286, SEQ ID NO: 326, SEQ ID NO: 374, SEQ ID 
NO: 399, SEQ ID NO: 422, SEQ ID NO: 454, SEQ ID NO: 465, SEQ ID NO: 998, SEQ 

35 ID NO: 1009, SEQ ID NO. 1023, SEQ ID NO: 1294, SEQ ID NO: 1295, SEQ ID NO: 
319, SEQ ID NO: 325, SEQ ID NO: 425, SEQ ID NO: 437, SEQ ID NO: 438, SEQ ID 
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NO: 447, SEQ ID NO: 448, SEQ ID NO: 467, SEQ ID NO. 996, SEQ ID NO: 1027, 
SEQ ID NO: 1031, SEQ ID NO: 254, SEQ ID NO: 352, SEQ ID NO: 415, SEQ ID NO: 
1019, SEQ ID NO: 381, SEQ ID NO: 389, SEQ ID NO: 1010, and SEQ ID NO: 1012. 

5 23. he purified nucleic acid of claim 22, wherein said H. pylori outer 

membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 
terminal phenylalanine residue or a fragment thereof encoded by the nucleic acid 
selected from the group consisting of SEQ ID NO: 255, SEQ ID NO: 263, SEQ ID NO: 
266, SEQ ID NO: 277, SEQ ID NO: 280, SEQ ID NO: 285, SEQ ID NO: 292, SEQ ID 

10 NO: 294, SEQ ID NO: 299, SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ 
ID NO: 321, SEQ ID NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, SEQ ID NO: 353, 
SEQ ID NO: 364, SEQ ID NO: 366, SEQ ID NO: 368, SEQ ID NO: 375, SEQ ID NO: 
384, SEQ ID NO: 391, SEQ ID NO: 392, SEQ ID NO: 397, SEQ ID NO: 398, SEQ ID 
NO: 402, SEQ ID NO: 404, SEQ ID NO. 409, SEQ ID NO: 410, SEQ ID NO: 412, SEQ 

1 5 ID NO: 427, SEQ ID NO: 433, SEQ ID NO: 434, SEQ ID NO: 441 , SEQ ID NO: 444, 
SEQ ID NO: 445, SEQ ID NO: 449, SEQ ID NO: 450, SEQ ID NO: 452, SEQ ID NO: 
453, SEQ ID NO: 466, SEQ ID NO: 468, SEQ ID NO: 469, SEQ ID NO: 983, SEQ ID 
NO: 989, SEQ ID NO: 1008, SEQ ID NO: 1011, SEQ ID NO: 1014, SEQ ID NO: 1015, 
SEQ ID NO: 1029, and SEQ ID NO: 1032. 

20 

24. The purified nucleic acid of claim 22, wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a C- 
terminal tyrosine cluster or a fragment thereof encoded by the nucleic acid selected from 
the group consisting of SEQ ID NO: 286, SEQ ID NO. 326, SEQ ID NO: 374, SEQ ID 

25 NO: 399, SEQ ID NO: 422, SEQ ID NO: 454, SEQ ID NO: 465, SEQ ID NO: 998, SEQ 
ID NO: 1009, SEQ ID NO: 1023, SEQ ID NO: 1294, and SEQ ID NO: 1295. 

25. The purified nucleic acid of claim 22, wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 

30 terminal phenylalanine residue and a C-terminal tyrosine cluster or a fragment thereof 
encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 319, 
SEQ ID NO: 325, SEQ ID NO: 425, SEQ ID NO: 437, SEQ ID NO: 438, SEQ ID NO: 
447, SEQ ID NO: 448, SEQ ID NO: 467, SEQ ID NO: 996, SEQ ID NO: 1027, and 
SEQ ID NO: 1031. 

35 
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26. The purified nucleic acid of claim 21 , wherein said //. pylori cell 
envelope polypeptide or a fragment thereof is an K pylori inner membrane polypeptide 
or a fragment thereof encoded by the nucleic acid selected from the group consisting of 
SEQ ID NO: 354, SEQ ID NO: 372, SEQ ID NO: 400, SEQ ID NO: 421, SEQ ID NO: 

5 1022, SEQ ID NO: 463, SEQ ID NO: 281, SEQ ID NO: 988, SEQ ID NO: 411, SEQ ID 
NO: 407, SEQ ID NO: 1017, SEQ ID NO: 290, SEQ ID NO: 417, SEQ ID NO: 430, 
SEQ ID NO: 992, and SEQ ID NO: 1025. 

27. The purified nucleic acid of claim 26, wherein said H. pylori inner 
10 membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in 

outer membrane and cell wall synthesis or a fragment thereof encoded by the nucleic 
acid comprising a nucleotide sequence of SEQ ID NO: 354. 

28. The purified nucleic acid of claim 26, wherein said H. pylori inner 

1 5 membrane polypeptide or a fragment thereof is an K pylori polypeptide involved in 
energy conversion or a fragment thereof encoded by the nucleic acid selected from the 
group consisting of SEQ ID NO: 372, SEQ ID NO: 400, SEQ ID NO: 421 , and SEQ ID 
NO: 1022. 

20 29. The purified nucleic acid of claim 26, wherein said H. pylori inner 

membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in 
cofactor metabolism or a fragment thereof encoded by the nucleic acid comprising a 
nucleotide sequence of SEQ ID NO: 463. 

25 30. The purified nucleic acid of claim 26, wherein said H, pylori inner 

membrane polypeptide or a fragment thereof is an K pylori polypeptide involved in 
secretion and adhesion or a fragment thereof encoded by the nucleic acid selected from 
the group consisting of SEQ ID NO: 281 and SEQ ID NO: 988. 

30 31. The purified nucleic acid of claim 26, wherein said K pylori inner 

membrane polypeptide or a fragment thereof is an K pylori polypeptide involved in 
transport or a fragment thereof encoded by the nucleic acid selected from the group 
consisting of SEQ ID NO: 407 and SEQ ID NO: 1017. 

35 32. The purified nucleic acid of claim 21, wherein said H. pylori cell 

envelope polypeptide or a fragment thereof is an H. pylori flagellar polypeptide or a 
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fragment thereof encoded by the nucleic acid comprising a nucleotide sequence of SEQ 
ID NO: 477. 

33. The purified nucleic acid of claim 21 , wherein said H. pylori cell 

5 envelope polypeptide or a fragment thereof is an H. pylori transport polypeptide or a 
fragment thereof encoded by the nucleic acid comprising a nucleotide sequence of SEQ 
ID NO: 414. 

34. A recombinant expression vector comprising the nucleic acid of claim 21 
1 0 operably linked to a transcription regulatory element. 

35. A cell comprising a recombinant expression vector of claim 34. 

36. A method for producing an H. pylori polypeptide comprising culturing a 
1 5 cell of claim 35 under conditions that permit expression of the polypeptide. 

37. An isolated nucleic acid comprising a nucleotide sequence encoding an 
H. pylori secreted polypeptide or a fragment thereof, said nucleic acid selected from the 
group consisting of SEQ ID NO: 355, SEQ ID NO: 1006, SEQ ID NO: 257, SEQ ID 

20 NO: 258, SEQ ID NO: 260, SEQ ID NO: 261 , SEQ ID NO: 264, SEQ ID NO: 265, SEQ 
ID NO: 268, SEQ ID NO: 270, SEQ ID NO: 272, SEQ ID NO: 274, SEQ ID NO: 275, 
SEQ ID NO: 276, SEQ ID NO: 279, SEQ ID NO: 283, SEQ ID NO: 284, SEQ ID NO: 
287, SEQ ID NO: 288, SEQ ID NO: 289, SEQ ID NO: 291, SEQ ID NO: 295, SEQ ID 
NO: 296, SEQ ID NO: 297, SEQ ID NO: 298, SEQ ID NO: 300, SEQ ID NO: 301 , SEQ 
25 ID NO: 302, SEQ ID NO: 303, SEQ ID NO: 304, SEQ ID NO: 305, SEQ ID NO: 314, 
SEQ ID NO: 315, SEQ ID NO: 323, SEQ ID NO: 338, SEQ ID NO: 342, SEQ ID NO: 
348, SEQ ID NO: 349, SEQ ID NO: 356, SEQ ID NO: 358, SEQ ID NO: 359, SEQ ID 
NO: 360, SEQ ID NO: 361 , SEQ ID NO: 362, SEQ ID NO: 363, SEQ ID NO: 367, SEQ 
ID NO: 370, SEQ ID NO: 371 , SEQ ID NO: 373, SEQ ID NO: 377, SEQ ID NO: 378, 
30 SEQ ID NO: 379, SEQ ID NO: 380, SEQ ID NO: 388, SEQ ID NO: 390, SEQ ID NO: 
394, SEQ ID NO: 395, SEQ ID NO: 396, SEQ ID NO: 401, SEQ ID NO: 403, SEQ ID 
NO: 405, SEQ ID NO: 408, SEQ ID NO: 420, SEQ ID NO: 426, SEQ ID NO: 428, SEQ 
ID NO: 429, SEQ ID NO: 432, SEQ ID NO: 439, SEQ ID NO: 442, SEQ ID NO: 451 , 
SEQ ID NO: 471 , SEQ ID NO: 478, SEQ ID NO: 488, SEQ ID NO: 987, SEQ ID NO: 
35 990, SEQ ID NO: 991, SEQ ID NO: 993, SEQ ID NO: 1001, SEQ ID NO: 1002, SEQ 
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IDNO: 1007, SEQ ID NO: 1013, SEQ ID NO: 1016, SEQ ID NO: 1018, SEQ ID NO: 
1021, and SEQ ID NO: 1026. 

38. The purified nucleic acid of claim 37, wherein said H. pylori secreted 

5 polypeptide or a fragment thereof is an H. pylori polypeptide involved in secretion and 
adhesion or a fragment thereof encoded by the nucleic acid selected from the group 
consisting of SEQ ID NO: 355 and SEQ ID NO: 1006. 

39. A recombinant expression vector comprising the nucleic acid of claim 38 
10 operably linked to a transcription regulatory element. 

40. A cell comprising a recombinant expression vector of claim 39. 

41 . A method for producing an H. pylori polypeptide comprising culturing a 
15 cell of claim 40 under conditions that permit expression of the polypeptide. 

42. An isolated nucleic acid comprising a nucleotide sequence encoding an 
H. pylori cytoplasmic polypeptide or a fragment thereof, said nucleic acid selected from 
the group consisting of SEQ ID NO: 470, SEQ ID NO: 1033, SEQ ID NO: 357, SEQ ID 

20 NO: 457, SEQ ID NO: 461, SEQ ID NO: 1030, SEQ ID NO: 345, SEQ ID NO: 383, 

SEQ ID NO: 387, SEQ ID NO: 455, SEQ ID NO: 1003, SEQ ID NO: 351, SEQ ID NO: 
416, SEQ ID NO: 278, SEQ ID NO: 335, SEQ ID NO: 346, SEQ ID NO: 350, SEQ ID 
NO: 419, SEQ ID NO: 460, SEQ ID NO: 472, SEQ ID NO: 1000, SEQ ID NO: 1004, 
SEQ ID NO: 1020, SEQ ID NO: 1293, SEQ ID NO: 318, SEQ ID NO: 322, SEQ ID 

25 NO: 324, SEQ ID NO: 330, SEQ ID NO: 347, SEQ ID NO: 440, SEQ ID NO: 446, SEQ 
ID NO: 464, SEQ ID NO: 490, SEQ ID NO: 491, SEQ ID NO: 995, SEQ ID NO: 997, 
SEQ ID NO: 1005, SEQ ID NO: 1028. 

43. The purified nucleic acid of claim 42, wherein said H. pylori cytoplasmic 
30 polypeptide or a fragment thereof is an H. pylori polypeptide involved in energy 

conversion or a fragment thereof encoded by the nucleic acid selected from the group 
consisting of SEQ ID NO: 470 and SEQ ID NO: 1033. 

44. The purified nucleic acid of claim 42, wherein said H. pylori cytoplasmic 
35 polypeptide or a fragment thereof is an K pylori polypeptide involved in amino acid 



bnsoocid- <yyo 9737044A1 i > BNS page 1119 



WO 97/37044 



PCT/US97/05223 



- 1118- 

metabolism and transport or a fragment thereof encoded by the nucleic acid selected 
from the group consisting of SEQ ID NO: 357 and SEQ ID NO: 457. 

45. The purified nucleic acid of claim 42, wherein said K pylori cytoplasmic 
5 polypeptide or a fragment thereof is an K pylori polypeptide involved in nucleotide 

metabolism and transport or a fragment thereof encoded by the nucleic acid selected 
from the group consisting of SEQ ID NO: 461 and SEQ ID NO: 1030. 

46. The purified nucleic acid of claim 42, wherein said H. pylori cytoplasmic 
10 polypeptide or a fragment thereof is an H. pylori polypeptide involved in cofactor 

metabolism or a fragment thereof encoded by the nucleic acid selected from the group 
consisting of SEQ ID NO: 345, SEQ ID NO: 383, SEQ ID NO: 387, SEQ ID NO: 455, 
and SEQ ID NO: 1003. 

1 5 47. The purified nucleic acid of claim 42, wherein said K pylori cytoplasmic 

polypeptide or a fragment thereof is an H. pylori polypeptide involved in lipid 
metabolism or a fragment thereof encoded by the nucleic acid selected from the group 
consisting of SEQ ID NO: 351 and SEQ ID NO: 416. 

20 48. The purified nucleic acid of claim 42, wherein said K pylori cytoplasmic 

polypeptide or a fragment thereof is an H. pylori polypeptide involved in genome 
replication, transcription, recombination and repair or a fragment thereof encoded by the 
nucleic acid selected from the group consisting of SEQ ID NO; 278, SEQ ID NO: 335, 
SEQ ID NO: 346, SEQ ID NO: 350, SEQ ID NO: 419, SEQ ID NO: 460, SEQ ID NO: 

25 472, SEQ ID NO: 1000, SEQ ID NO: 1004, SEQ ID NO: 1020, and SEQ ID NO: 1293. 

49. A recombinant expression vector comprising the nucleic acid of claim 42 
operably linked to a transcription regulatory element. 

30 50. A cell comprising a recombinant expression vector of claim 49. 

51. A method for producing an H. pylori polypeptide comprising culturing a 
cell of claim 50 under conditions that permit expression of the polypeptide. 

35 52. An isolated nucleic acid comprising a nucleotide sequence encoding an 

H. pylori cellular polypeptide or a fragment thereof, said nucleic acid selected from the 
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group consisting of SEQ ID NO: 256, SEQ ID NO: 267, SEQ ID NO: 282, SEQ ID NO: 
306, SEQ ID NO: 307, SEQ ID NO: 308, SEQ ID NO: 309, SEQ ID NO: 310, SEQ ID 
NO: 316, SEQ ID NO: 317, SEQ ID NO: 332, SEQ ID NO: 333, SEQ ID NO: 336, SEQ 
ID NO: 337, SEQ ID NO: 339, SEQ ID NO: 340, SEQ ID NO: 341, SEQ ID NO: 344, 
5 SEQ ID NO: 369, SEQ ID NO: 376, SEQ ID NO: 382, SEQ ID NO: 386, SEQ ID NO: 
423, SEQ ID NO: 431, SEQ ID NO: 435, SEQ ID NO: 436, SEQ ID NO: 458, SEQ ID 
NO: 462, SEQ ID NO: 475, SEQ ID NO: 476, SEQ ID NO: 479, SEQ ID NO: 480, SEQ 
ID NO: 481, SEQ ID NO: 482, SEQ ID NO: 483, SEQ ID NO: 484, SEQ ID NO: 485, 
SEQ ID NO: 486, SEQ ID NO: 487, SEQ ID NO: 489, SEQ ID NO: 984, SEQ ID NO: 
10 994, SEQ ID NO: 1024, and SEQ ID NO: 1036. 

53. A recombinant expression vector comprising the nucleic acid of claim 52 
operably linked to a transcription regulatory element. 

15 54. A cell comprising a recombinant expression vector of claim 53. 

55. A method for producing an H. pylori polypeptide comprising culturing a 
cell of claim 54 under conditions that permit expression of the polypeptide. 

20 56. A vaccine composition for prevention or treatment of an H. pylori 

infection comprising an effective amount of a nucleic acid of claim 21 . 

57. A vaccine composition of claim 56, further comprising a 
pharmaceutically acceptable carrier. 

25 

58. A vaccine composition of claim 57, wherein the pharmaceutically 
acceptable carrier is an adjuvant. 

59. A method of treating a subject for H. pylori infection comprising 

30 administering to a subject a vaccine composition of claim 56, such that treatment of H. 
pylori infection occurs. 

60. A method of claim 59, wherein the treatment is a prophylactic treatment. 
35 61 . A method of claim 59, wherein the treatment is a therapeutic treatment. 
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62. A vaccine composition for prevention or treatment of an H. pylori 
infection comprising an effective amount of a nucleic acid of claim 37. 

63. A vaccine composition of claim 62, further comprising a 
5 pharmaceutical^ acceptable carrier. 

64. A vaccine composition of claim 63, wherein the pharmaceutical^ 
acceptable carrier is an adjuvant. 

10 65. A method of treating a subject for K pylori infection comprising 

administering to a subject a vaccine composition of claim 62, such that treatment of H. 
pylori infection occurs. 

66. A method of claim 65, wherein the treatment is a prophylactic treatment. 

15 

67. A method of claim 65, wherein the treatment is a therapeutic treatment. 

68. A vaccine composition for prevention or treatment of an H. pylori 
infection comprising an effective amount of a nucleic acid of claim 42. 

20 

69. A vaccine composition of claim 68, further comprising a 
pharmaceutical^ acceptable carrier. 

70. A vaccine composition of claim 69, wherein the pharmaceutical ly 
25 acceptable carrier is an adjuvant. 

71. A method of treating a subject for H. pylori infection comprising 
administering to a subject a vaccine composition of claim 68, such that treatment of H. 
pylori infection occurs. 

30 

72. A method of claim 71, wherein the treatment is a prophylactic treatment. 

73. A method of claim 71, wherein the treatment is a therapeutic treatment. 

35 74. A vaccine composition for prevention or treatment of an H. pylori 

infection comprising an effective amount of a nucleic acid of claim 52. 
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75. A vaccine composition of claim 74, further comprising a 
pharmaceutically acceptable carrier. 

5 76. A vaccine composition of claim 75, wherein the pharmaceutically 

acceptable carrier is an adjuvant. 

77. A method of treating a subject for H. pylori infection comprising 
administering to a subject a vaccine composition of claim 74, such that treatment of H. 

10 pylori infection occurs. 

78. A method of claim 77, wherein the treatment is a prophylactic treatment. 

79. A method of claim 77, wherein the treatment is a therapeutic treatment. 

15 

80. A purified H. pylori cell envelope polypeptide or a fragment thereof, 
wherein said polypeptide is selected from the group consisting of SEQ ID NO: 746, 
SEQ ID NO: 754, SEQ ID NO: 757, SEQ ID NO: 768, SEQ ID NO: 771, SEQ ID NO: 
776, SEQ ID NO: 783, SEQ ID NO: 785, SEQ ID NO: 790, SEQ ID NO: 802, SEQ ID 

20 NO: 803, SEQ ID NO: 804, SEQ ID NO: 812, SEQ ID NO: 818, SEQ ID NO: 820, SEQ 
ID NO: 882, SEQ ID NO: 844, SEQ ID NO: 855, SEQ ID NO: 857, SEQ ID NO: 859, 
SEQ ID NO: 866, SEQ ID NO: 875, SEQ ID NO: 882, SEQ ID NO: 883, SEQ ID NO: 
888, SEQ ID NO: 889, SEQ ID NO: 893, SEQ ID NO: 895, SEQ ID NO: 900, SEQ ID 
NO: 901, SEQ ID NO: 903, SEQ ID NO: 918, SEQ ID NO: 924, SEQ ID NO: 925, SEQ 

25 ID NO: 932, SEQ ID NO: 935, SEQ ID NO: 936, SEQ ID NO: 940, SEQ ID NO. 941 , 
SEQ ID NO: 943, SEQ ID NO: 944, SEQ ID NO: 957, SEQ ID NO: 959, SEQ ID NO: 
960, SEQ ID NO: 1037, SEQ ID NO: 1043, SEQ ID NO. 1062, SEQ ID NO: 1065, SEQ 
ID NO: 1068, SEQ ID NO: 1069, SEQ ID NO: 1083, SEQ ID NO: 1086, SEQ ID NO: 
750, SEQ ID NO: 777, SEQ ID NO: 817, SEQ ID NO: 865, SEQ ID NO: 890, SEQ ID 

30 NO: 913, SEQ ID NO: 945, SEQ ID NO: 956, SEQ ID NO: 1052, SEQ ID NO: 1063, 
SEQ ID NO: 1077, SEQ ID NO: 1297, SEQ ID NO: 1298, SEQ ID NO: 810, SEQ ID 
NO: 816, SEQ ID NO: 916, SEQ ID NO: 928, SEQ ID NO: 929, SEQ ID NO: 938, SEQ 
ID NO: 939, SEQ ID NO: 958, SEQ ID NO: 1050, SEQ ID NO: 1081, SEQ ID NO: 
1085, SEQ ID NO: 745, SEQ ID NO: 843, SEQ ID NO: 906, SEQ ID NO: 1073, SEQ 

35 ID NO: 872, SEQ ID NO: 880, SEQ ID NO: 1064, SEQ ID NO: 1066, SEQ ID NO: 

845, SEQ ID NO: 863, SEQ ID NO: 891 , SEQ ID NO. 912, SEQ ID NO: 1076, SEQ ID 
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NO: 954, SEQ ID NO: 772, SEQ ID NO: 1042, SEQ ID NO: 902, SEQ ID NO: 898, 
SEQ ID NO: 1071, SEQ ID NO: 781, SEQ ID NO: 908, SEQ ID NO: 921, SEQ ID NO: 
1046, SEQ ID NO: 1079, SEQ ID NO: 968, SEQ ID NO: 905, SEQ ID NO. 744, SEQ 
ID NO: 784, SEQ ID NO: 825, SEQ ID NO: 834, SEQ ID NO: 909, SEQ ID NO: 915, 
5 and SEQ ID NO: 934. 

8 1 . The purified polypeptide of claim 80, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide 
or a fragment thereof selected from the group consisting of SEQ ID NO: 746, SEQ ID 

10 NO: 754, SEQ ID NO: 757, SEQ ID NO: 768, SEQ ID NO: 771, SEQ ID NO: 776, SEQ 
ID NO: 783, SEQ ID NO: 785, SEQ ID NO: 790, SEQ ID NO: 802, SEQ ID NO: 803, 
SEQ ID NO: 804, SEQ ID NO: 812, SEQ ID NO: 818, SEQ ID NO: 820, SEQ ID NO: 
882, SEQ ID NO: 844, SEQ ID NO: 855, SEQ ID NO: 857, SEQ ID NO: 859, SEQ ID 
NO: 866, SEQ ID NO: 875, SEQ ID NO: 882, SEQ ID NO: 883, SEQ ID NO: 888, SEQ 

1 5 ID NO: 889, SEQ ID NO: 893, SEQ ID NO: 895, SEQ ID NO: 900, SEQ ID NO: 901 , 
SEQ ID NO: 903, SEQ ID NO: 91 8, SEQ ID NO: 924, SEQ ID NO: 925, SEQ ID NO: 
932, SEQ ID NO: 935, SEQ ID NO: 936, SEQ ID NO: 940, SEQ ID NO: 941 , SEQ ID 
NO: 943, SEQ ID NO: 944, SEQ ID NO: 957, SEQ ID NO: 959, SEQ ID NO: 960, SEQ 
ID NO: 1037, SEQ ID NO: 1043, SEQ ID NO: 1062, SEQ ID NO: 1065, SEQ ID NO. 

20 1068, SEQ ID NO: 1069, SEQ ID NO: 1083, SEQ ID NO: 1086, SEQ ID NO: 750, SEQ 
ID NO. 777, SEQ ID NO: 817, SEQ ID NO: 865, SEQ ID NO: 890, SEQ ID NO: 913, 
SEQ ID NO: 945, SEQ ID NO: 956, SEQ ID NO. 1052, SEQ ID NO: 1063, SEQ ID 
NO: 1077, SEQ ID NO: 1297, SEQ ID NO: 1298, SEQ ID NO: 810, SEQ ID NO: 816, 
SEQ ID NO: 916, SEQ ID NO: 928, SEQ ID NO: 929, SEQ ID NO: 938, SEQ ID NO: 

25 939, SEQ ID NO: 958, SEQ ID NO: 1050, SEQ ID NO: 1 081 , SEQ ID NO: 1085, SEQ 
ID NO: 745, SEQ ID NO: 843, SEQ ID NO: 906, SEQ ID NO: 1073, SEQ ID NO: 872, 
SEQ ID NO: 880, SEQ ID NO: 1064, and SEQ ID NO: 1066. 

82. The purified polypeptide of claim 81, wherein said H. pylori outer 
30 membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 

terminal phenylalanine residue or a fragment thereof selected from the group consisting 
of SEQ ID NO: 746, SEQ ID NO: 754, SEQ ID NO: 757, SEQ ID NO: 768, SEQ ID 
NO. 771, SEQ ID NO: 776, SEQ ID NO: 783, SEQ ID NO: 785, SEQ ID NO: 790, SEQ 
ID NO: 802, SEQ ID NO. 803, SEQ ID NO: 804, SEQ ID NO: 812, SEQ ID NO: 818, 
35 SEQ ID NO: 820, SEQ ID NO: 882, SEQ ID NO: 844, SEQ ID NO: 855, SEQ ID NO: 
857, SEQ ID NO: 859, SEQ ID NO: 866, SEQ ID NO. 875, SEQ ID NO: 882, SEQ ID 
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NO: 883, SEQ ID NO: 888, SEQ ID NO: 889, SEQ ID NO: 893, SEQ ID NO: 895, SEQ 
ID NO: 900, SEQ ID NO: 901, SEQ ID NO: 903, SEQ ID NO: 918, SEQ ID NO: 924, 
SEQ ID NO. 925, SEQ ID NO: 932, SEQ ID NO: 935, SEQ ID NO: 936, SEQ ID NO: 
940, SEQ ID NO: 941, SEQ ID NO: 943, SEQ ID NO: 944, SEQ ID NO: 957, SEQ ID 
5 NO: 959, SEQ ID NO: 960, SEQ ID NO: 1037, SEQ ID NO: 1043, SEQ ID NO: 1062, 
SEQ ID NO: 1065, SEQ ID NO: 1068, SEQ ID NO: 1069, SEQ ID NO: 1083, and SEQ 
ID NO: 1086. 



83. The purified polypeptide of claim 81, wherein said H. pylori outer 
1 0 membrane polypeptide or a fragment thereof is an H. pylori polypeptide a C-terminal 

tyrosine cluster or a fragment thereof selected from the group consisting of SEQ ID NO: 
746, SEQ ID NO: 754, SEQ ID NO: 757, SEQ ID NO: 768, SEQ ID NO: 771, SEQ ID 
NO: 776, SEQ ID NO: 783, SEQ ID NO: 785, SEQ ID NO: 790, SEQ ID NO: 802, SEQ 
ID NO: 803, SEQ ID NO: 804, SEQ ID NO: 812, SEQ ID NO: 818, SEQ ID NO: 820, 
1 5 SEQ ID NO: 882, SEQ ID NO: 844, SEQ ID NO: 855, SEQ ID NO: 857, SEQ ID NO: 
859, SEQ ID NO: 866, SEQ ID NO: 875, SEQ ID NO: 882, SEQ ID NO: 883, SEQ ID 
NO: 888, SEQ ID NO: 889, SEQ ID NO: 893, SEQ ID NO: 895, SEQ ID NO: 900, SEQ 
ID NO: 901, SEQ ID NO: 903, SEQ ID NO: 918, SEQ ID NO: 924, SEQ ID NO: 925, 
SEQ ID NO: 932, SEQ ID NO: 935, SEQ ID NO: 936, SEQ ID NO: 940, SEQ ID NO: 
20 941, SEQ ID NO: 943, SEQ ID NO: 944, SEQ ID NO: 957, SEQ ID NO: 959, SEQ ID 
NO: 960, SEQ ID NO: 1037, SEQ ID NO: 1043, SEQ ID NO: 1 062, SEQ ID NO: 1065, 
SEQ ID NO: 1068, SEQ ID NO: 1069, SEQ ID NO: 1083, and SEQ ID NO: 1086. 



84. The purified polypeptide of claim 81, wherein said H. pylori outer 
25 membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 

terminal phenylalanine residue and a C-terminal tyrosine cluster or a fragment thereof 
selected from the group consisting of SEQ ID NO: 810, SEQ ID NO: 816, SEQ ID NO: 
916, SEQ ID NO: 928, SEQ ID NO: 929, SEQ ID NO: 938, SEQ ID NO: 939, SEQ ID 
NO: 958, SEQ ID NO: 1050, SEQ ID NO: 1081, and SEQ ID NO: 1085. 

30 

85. The purified polypeptide of claim 80, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori inner membrane polypeptide 
or a fragment thereof selected from the group consisting of SEQ ID NO: 845, SEQ ID 
NO: 863, SEQ ID NO: 891, SEQ ID NO: 912, SEQ ID NO: 1076, SEQ ID NO: 954, 

35 SEQ ID NO: 772, SEQ ID NO: 1042, SEQ ID NO: 902, SEQ ID NO: 898, SEQ ID NO: 
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1071, SEQ ID NO: 781, SEQ ID NO: 908, SEQ ID NO: 921, SEQ ID NO: 1046, SEQ 
ID NO: 1079. 

86. The purified polypeptide of claim 85, wherein said K pylori inner 

5 membrane polypeptide or a fragment thereof is an K pylori polypeptide involved in 
outer membrane and cell wall synthesis or a fragment thereof comprising an amino acid 
sequence of SEQ ID NO: 845. 

87. The purified polypeptide of claim 85, wherein said K pylori inner 
10 membrane polypeptide or a fragment thereof is an H, pylori polypeptide involved in 

energy conversion or a fragment thereof selected from the group consisting of SEQ ID 
NO: 863, SEQ ID NO: 891, SEQ ID NO: 912, and SEQ ID NO: 1076. 

88. The purified polypeptide of claim 85, wherein said K pylori inner 

1 5 membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in 
cofactor metabolism or a fragment thereof comprising an amino acid sequence of SEQ 
ID NO: 954. 

89. The purified polypeptide of claim 85, wherein said H. pylori inner 
20 membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in 

secretion and adhesion or a fragment thereof selected from the group consisting of SEQ 
ID NO: 772 and SEQ ID NO: 1042. 

90. The purified polypeptide of claim 85, wherein said H. pylori inner 
25 membrane polypeptide or a fragment thereof is an K pylori polypeptide involved in 

transport or a fragment thereof selected from the group consisting of SEQ ID NO: 898 
and SEQ ID NO: 1071. 

91 . The purified polypeptide of claim 80, wherein said H. pylori cell 

30 envelope polypeptide or a fragment thereof is an H. pylori flagellar polypeptide or a 
fragment thereof comprising an amino acid sequence of SEQ ID NO: 968. 

92. The purified polypeptide of claim 80, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori transport polypeptide or a 

35 fragment thereof comprising an amino acid sequence of SEQ ID NO: 905. 
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93. A purified H. pylori cellular polypeptide or a fragment thereof, wherein 
said polypeptide is selected from the group consisting of SEQ ID NO: 747, SEQ ID NO: 
758, SEQ ID NO: 773, SEQ ID NO: 797, SEQ ID NO: 798, SEQ ID NO: 799, SEQ ID 
NO: 800, SEQ ID NO: 801, SEQ ID NO: 807, SEQ ID NO: 808, SEQ ID NO: 823, SEQ 

5 ID NO: 824, SEQ ID NO: 827, SEQ ID NO: 828, SEQ ID NO: 830, SEQ ID NO: 83 1 , 
SEQ ID NO: 832, SEQ ID NO: 835, SEQ ID NO: 860, SEQ ID NO: 867, SEQ ID NO: 
873, SEQ ID NO: 877, SEQ ID NO: 914, SEQ ID NO: 922, SEQ ID NO: 926, SEQ ID 
NO: 927, SEQ ID NO: 949, SEQ ID NO: 953, SEQ ID NO: 966, SEQ ID NO: 967, SEQ 
ID NO: 970, SEQ ID NO: 971, SEQ ID NO: 972, SEQ ID NO: 973, SEQ ID NO: 974, 
1 0 SEQ ID NO: 975, SEQ ID NO: 976, SEQ ID NO: 977, SEQ ID NO: 978, SEQ ID NO: 
980, SEQ ID NO: 1038, SEQ ID NO: 1048, SEQ ID NO: 1078, and SEQ ID NO: 1090. 

94. A purified H. pylori secreted polypeptide or a fragment thereof, wherein 
said polypeptide is selected from the group consisting of SEQ ID NO: 846, SEQ ID NO: 

15 1060, SEQ ID NO: 748, SEQ ID NO: 749, SEQ ID NO: 751, SEQ ID NO: 752, SEQ ID 
NO: 755, SEQ ID NO: 756, SEQ ID NO: 759, SEQ ID NO: 761 , SEQ ID NO: 763, SEQ 
ID NO: 765, SEQ ID NO: 766, SEQ ID NO: 767, SEQ ID NO: 770, SEQ ID NO: 774, 
SEQ ID NO: 775, SEQ ID NO: 778, SEQ ID NO: 779, SEQ ID NO: 780, SEQ ID NO: 
782, SEQ ID NO: 786, SEQ ID NO: 787, SEQ ID NO: 788, SEQ ID NO: 789, SEQ ID 

20 NO: 791 , SEQ ID NO: 792, SEQ ID NO: 793, SEQ ID NO: 794, SEQ ID NO: 795, SEQ 
ID NO: 796, SEQ ID NO: 805, SEQ ID NO: 806, SEQ ID NO: 814, SEQ ID NO: 829, 
SEQ ID NO: 833, SEQ ID NO: 839, SEQ ID NO: 840, SEQ ID NO: 849, SEQ ID NO: 
850, SEQ ID NO: 851, SEQ ID NO: 852, SEQ ID NO: 853, SEQ ID NO: 854, SEQ ID 
NO: 858, SEQ ID NO: 861, SEQ ID NO: 862, SEQ ID NO: 864, SEQ ID NO: 868, SEQ 

25 ID NO: 869, SEQ ID NO: 870, SEQ ID NO: 871 , SEQ ID NO: 879, SEQ ID NO: 881 , 
SEQ ID NO: 885, SEQ ID NO: 886, SEQ ID NO: 887, SEQ ID NO: 892, SEQ ID NO: 
894, SEQ ID NO: 896, SEQ ID NO: 899, SEQ ID NO: 91 1, SEQ ID NO: 917, SEQ ID 
NO: 919, SEQ ID NO: 920, SEQ ID NO: 923, SEQ ID NO: 930, SEQ ID NO: 933, SEQ 
ID NO: 942, SEQ ID NO: 962, SEQ ID NO: 969, SEQ ID NO: 979, SEQ ID NO: 1041, 

30 SEQ ID NO: 1044, SEQ ID NO: 1045, SEQ ID NO: 1047, SEQ ID NO: 1055, SEQ ID 
NO: 1056, SEQ ID NO: 1061, SEQ ID NO: 1067, SEQ ID NO: 1070, SEQ ID NO: 
1072, SEQ ID NO: 1075, and SEQ ID NO: 1080. 

95. The purified polypeptide of claim 94, wherein said H. pylori secreted 
35 polypeptide or a fragment thereof is an H. pylori polypeptide involved in secretion and 
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adhesion or a fragment thereof selected from the group consisting of SEQ ID NO: 846 
and SEQ ID NO: 1060, 

96. A purified H. pylori cytoplasmic polypeptide or a fragment thereof, 
5 wherein said polypeptide is selected from the group consisting of SEQ ID NO: 961, 

SEQ ID NO: 1087, SEQ ID NO: 848, SEQ ID NO: 948, SEQ ID NO: 952, SEQ ID NO: 
1084, SEQ ID NO: 836, SEQ ID NO: 874, SEQ ID NO: 878, SEQ ID NO: 946, SEQ ID 
NO: 1057, SEQ ID NO: 842, SEQ ID NO: 907, SEQ ID NO: 769, SEQ ID NO: 826, 
SEQ ID NO: 837, SEQ ID NO: 841, SEQ ID NO: 910, SEQ ID NO: 951, SEQ ID NO: 
10 963, SEQ ID NO: 1054, SEQ ID NO: 1058, SEQ ID NO: 1074, SEQ ID NO: 1296, SEQ 
ID NO: 809, SEQ ID NO: 813, SEQ ID NO: 815, SEQ ID NO: 821, SEQ ID NO: 838, 
SEQ ID NO: 93 1 , SEQ ID NO: 937, SEQ ID NO: 955, SEQ ID NO: 98 1 , SEQ ID NO: 
982, SEQ ID NO: 1049, SEQ ID NO: 1051, SEQ ID NO: 1059, and SEQ ID NO: 1082. 

15 97. The purified polypeptide of claim 96, wherein said H. pylori cytoplasmic 

polypeptide or a fragment thereof is an H. pylori polypeptide involved in energy 
conversion or a fragment thereof selected from the group consisting of SEQ ID NO: 961 
and SEQ ID NO: 1087. 

20 98. The purified polypeptide of claim 96, wherein said K pylori cytoplasmic 

polypeptide or a fragment thereof is an H. pylori polypeptide involved in amino acid 
metabolism and transport or a fragment thereof selected from the group consisting of 
SEQ ID NO: 848 and SEQ ID NO: 948. 

25 99. The purified polypeptide of claim 96, wherein said H, pylori cytoplasmic 

polypeptide or a fragment thereof is an H. pylori polypeptide involved in nucleotide 
metabolism and transport or a fragment thereof selected from the group consisting of 
SEQ ID NO: 952 and SEQ ID NO: 1084. 

30 1 00. The purified polypeptide of claim 96, wherein said H. pylori cytoplasmic 

polypeptide or a fragment thereof is an K pylori polypeptide involved in cofactor 
metabolism or a fragment thereof selected from the group consisting of SEQ ID NO: 
836, SEQ ID NO: 874, SEQ ID NO: 878, SEQ ID NO: 946, and SEQ ID NO: 1057. 

35 ioi. The purified polypeptide of claim 96, wherein said H. pylori cytoplasmic 

polypeptide or a fragment thereof is an H. pylori polypeptide involved in lipid 
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metaboiism or a fragment thereof selected from the group consisting of SEQ ID NO: 
842 and SEQ ID NO: 907. 

102. The purified polypeptide of claim 96, wherein said H. pylori cytoplasmic 
polypeptide or a fragment thereof is an K pylori polypeptide involved in genome 
replication, transcription, recombination and repair or a fragment thereof selected from 
the group consisting of SEQ ID NO: 769, SEQ ID NO: 826, SEQ ID NO: 837, SEQ ID 
NO: 841, SEQ ID NO: 910, SEQ ID NO: 951, SEQ ID NO: 963, SEQ ID NO: 1054, 
SEQ ID NO: 1058, SEQ ID NO: 1074, and SEQ ID NO: 1296. 

103. A vaccine composition for prevention or treatment of an H. pylori 
infection comprising an effective amount of an K pylori polypeptide of claim 80. 

104. A vaccine composition of claim 103, further comprising a 
1 5 pharmaceutical ly acceptable carrier. 

105. A vaccine composition of claim 104, wherein the pharmaceutically 
acceptable carrier is an adjuvant. 

20 1 06. A method of treating a subject for H. pylori infection comprising 

administering to a subject a vaccine composition of claim 103, such that treatment of H. 
pylori infection occurs. 

107. A method of claim 106, wherein the treatment is a prophylactic 
25 treatment. 

108. A method of claim 106, wherein the treatment is a therapeutic treatment. 

109. A vaccine composition for prevention or treatment of an H. pylori 
30 infection comprising an effective amount of an K pylori polypeptide of claim 93. 

110. A vaccine composition of claim 109, further comprising a 
pharmaceutically acceptable carrier. 

35 111. A vaccine composition of claim 1 1 0, wherein the pharmaceutically 

acceptable carrier is an adjuvant. 
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112. A method of treating a subject for H. pylori infection comprising 
administering to a subject a vaccine composition of claim 1 09, such that treatment of H. 
pylori infection occurs. 

113. A method of claim 112, wherein the treatment is a prophylactic 
treatment. 



114. A method of claim 112, wherein the treatment is a therapeutic treatment. 

10 

115. A vaccine composition for prevention or treatment of an H. pylori 
infection comprising an effective amount of an K pylori polypeptide of claim 94. 



116. A vaccine composition of claim 1 1 5, further comprising a 
15 pharmaceutical^ acceptable carrier. 

117. A vaccine composition of claim 1 16, wherein the pharmaceutically 
acceptable carrier is an adjuvant. 

20 1 1 8. A method of treating a subject for K pylori infection comprising 

administering to a subject a vaccine composition of claim 115, such that treatment of H. 
pylori infection occurs. 

119. A method of claim 1 18, wherein the treatment is a prophylactic 
25 treatment. 



120. A method of claim 118, wherein the treatment is a therapeutic treatment. 



121. A vaccine composition for prevention or treatment of an H. pylori 
30 infection comprising an effective amount of an H. pylori polypeptide of claim 96. 

122. A vaccine composition of claim 121, further comprising a 
pharmaceutically acceptable carrier. 

35 123. A vaccine composition of claim 122, wherein the pharmaceutically 

acceptable carrier is an adjuvant. 
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1 24. A method of treating a subject for H pylori infection comprising 
administering to a subject a vaccine composition of claim 121 , such that treatment of H 
pylori infection occurs. 

125. A method of claim 124, wherein the treatment is a prophylactic 
treatment. 

126. A method of claim 124, wherein the treatment is a therapeutic treatment. 



127. A method for detecting the presence of a. Helicobacter nucleic acid in a 
sample comprising: 

(a) contacting a sample with a nucleic acid of claim 21 under 
conditions in which a hybrid can form between the probe and a Helicobacter nucleic 

1 S acid in the sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a 
hybrid indicates the presence of a Helicobacter nucleic acid in the sample. 

128. A method for detecting the presence of a Helicobacter nucleic acid in a 
20 sample comprising: 

(a) contacting a sample with a nucleic acid of claim 37 under 
conditions in which a hybrid can form between the probe and a Helicobacter nucleic 
acid in the sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a 
25 hybrid indicates the presence of a Helicobacter nucleic acid in the sample. 

129. A method for detecting the presence of a Helicobacter nucleic acid in a 
sample comprising: . 

(a) contacting a sample with a nucleic acid of claim 42 under 

30 conditions in which a hybrid can form between the probe and a Helicobacter nucleic 
acid in the sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a 
hybrid indicates the presence of a Helicobacter nucleic acid in the sample. 
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130. A method for detecting the presence of a Helicobacter nucleic acid in a 
sample comprising: 

(a) contacting a sample with a nucleic acid of claim 52 under 
conditions in which a hybrid can form between the probe and a Helicobacter nucleic 

5 acid in the sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a 
hybrid indicates the presence of a Helicobacter nucleic acid in the sample. 
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